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Electromagnetic Propagation in Homogeneous 
Media With Hermitian Permeability and 
Permittivity 


By E. E. BERGMANN 


(Manuscript received November 5, 1981) 


The problem of electromagnetic radiation traveling in a general 
homogeneous, but anisotropic and gyrotropic, medium has been 
solved. The plane wave representation is used to convert Maxwell’s 
equations into a general eigenproblem, which allows for tensor 
permeability and permittivity and for electric and magnetic gyro- 
tropy. This formulation can be applied directly to computer evalua- 
tion of wave velocities and polarizations for a given wave-normal 
direction. The reflection and refraction directions and amplitudes at 
an interface between two different, general anisotropic and gyrotropic 
media are solved. It is outlined how, with the aid of a digital 
computer, ray tracing through a general succession of anisotropic 
and gyrotropic, but locally homogeneous, media can be carried out. 


|. INTRODUCTION 


Traditionally, the theory of light propagation in transparent, aniso- 
tropic media has assumed that the permeability of the medium was 
the same as that of the vacuum, or at least that it was isotropic. 
Conversely, the theory of microwave propagation in ferrites and gar- 
nets has assumed that the permittivity was isotropic in spite of the 
permeability being anisotropic and possibly, gyrotropic. (References 1 
and 2 are examples of the optics tradition and the microwave tradition, 
respectively.) 

Both viewpoints are incomplete, particularly, when applied to in- 
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frared optics where anisotropy effects in both permeability and per- 
mittivity could be important. As a case in point, Yttrium Iron Garnet 
(yIG) is optically transparent at infrared wavelengths beyond 1 pm* 
and at microwave frequencies; it could be used at wavelengths where 
the effects of permeability and permittivity are comparable. 

We develop the theory of electromagnetic wave propagation in a 
homogeneous medium where both the permeability and permittivity 
are anisctropic. Furthermore, by assuming that the permittivity and 
permeability tensors are Hermitian, not merely real and symmetric, 
we incorporate the more general effects of gyrotropy. 


ll. BASIC ASSUMPTIONS 


We assume the general validity of Maxwell’s equations without 
“source terms” (no charges or currents): 


V x H= @D/at (1) 
V xX E= -0B/at (2) 
V-D=0 (3) 
V-B=0, (4) 


where E, D, B, H are complex vector fields which are functions of 
space and time (r, ¢). 

We shall restrict our considerations to plane-wave solutions where 
the above four vector fields can be cast in the form 


F(r, ¢) = Foexp[z(k-r — wf) ], (5) 


where Fo is a complex vector constant and w is a real constant. The 
vector, k, is usually considered to be real, but may be complex even in 
lossless media (for “evanescent waves’). 

To complete the specifications, we need to define material-depend- 
ent relationships between the four vector fields of Maxwell’s equations. 
We do so by means of the electromagnetic energy density: 


U = ‘2E*-D + %H*-B. (6) 


For a plane wave, we can substitute (5) into (6) to show that U is 
independent of time and position (assuming k is real): 


U = 'E¢ -Do + %4HG - Bo. (7) 


By assuming that U is a purely quadratic function of the components 
of Eg, we obtain 


Do = €Eo. (8) 


The Hermiticity of €, the permittivity or dielectric tensor, follows from 
the reality of U. 
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Similarly, by assuming that U is a purely quadratic function of the 
components of Ho, we conclude that 


Bo = fi Ho, (9) 
where the permeability tensor, ji, is Hermitian. 


Our assumptions are consistent with magneto-optic effects where, 
conventionally, a Polder tensor‘ is defined, e.g., 


wh Wa O 
p={ wa p O }. (10) 
0 oO yp 


Using the plane-wave representation (5) in Maxwell’s first pair of 
equations, (1) and (2), we obtain 


k xX H = -oD. (11) 
k X E= wB. (12) 
Similar substitutions into Maxwell’s last pair of equations, (3) and 
4), 
| k-D=0 (13) 
k-B=0, (14) 
which, in fact, follow from (11) and (12) when w + 0. 


lll. THE WAVE VELOCITY 


We rewrite the plane-wave representation (5) in a suggestive form 
F = Foexp[—(Imk) -r]-exp[z|Re k|(”-r — vut)], (15) 


where 
n= Rek/|Rek| (16) 


is a unit vector that is normal to the wave fronts, and the wave velocity 
Vw = w/|Rek|. (17) 


Thus, v, represents the rate at which the wave fronts appear to be 
advancing in the n direction. 

We usually assume that Im k = 0. However, for Im k ¥ 0 we have 
an “evanescent wave”; a wave whose intensity diminishes in the Im k 
direction. It will be shown later, under the discussion of Poynting’s 
vector, that Im k is perpendicular to Poynting’s vector in nondissipa- 
tive media. 

Let us assume now that Im k = 0 and that, consequently, 71 is 
parallel to k. We can restate (25) and (26) as follows: 


n XH =-v,D = —-v,,éE (18) 
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nX E=v.B = v,jiHH. (19) 


If we go to a six-dimensional representation, we may combine (18) 
and (19) into a single-matrix equation: 


0 0 0 0 nz —Ny E, 
0 0 0 -A2 O fr Ey 
0 0 0 hy —h, 0 E. 
0 —-n, ny 0 0 0 H, 
Az O -rx O 0 0 H, 
—ny ny O0 0 0 0 H; 

€xx  Exy Exz O 0 O . 

&x Gy & O O O Ey 

- €xx Gy & 0 OO O Ez 

poe 0 0 0 Pixx Pxy Uxz Hi, ey 
0 0 DO bye My Hye H, 
0 0 0 Pex Pz Pz H, 


For convenience, we shall compress (20) by defining a more compact 


notation 
0 —-N\/(E é 0\/E 
(i “o)(a)-~(0 Qe) 


av = vy Bp. (22) 


Note that (22) is clearly a general eigenproblem. The tensor, 8, is 
determined by the permeability and permittivity of the medium. If k 
is real, then @is determined completely by 7. 

Both &@ and f are usually Hermitian. Tensor, &, is symmetric. It is 
real if k was real. We can restate (6) as 


2U = v*- Bp. (23) 


Away from dramatic resonances, we may assume that U is positive 
definite as well as real. If so, B is not only Hermitian but positive 
definite. 

When £ is both Hermitian and positive definite, we can find its 
inverse and its square root (all of which are Hermitian and positive 
definite as well). Thus, we are able to convert (8) to an ordinary 
eigenproblem of a Hermitian matrix: 


(BEB?) (Br) = vl Br). (24) 


or 


The Hermiticity of (8~/?&8~'”) guarantees that there are six eigen- 
solutions, each with a real v,,. 
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We can use the Hermiticity of & and B to prove orthogonality: 
vi Sp. = vir vt «Bye = vuov¥ «Bre. (25) 


Clearly, when vi; does not equal v2, the corresponding eigenvectors 
satisfy the orthogonality relations: 


vi Gv. = 0 = v¥- Bro. (26) 


Additional orthogonality relationships can be proved if vin ¥ vine 
(1 =1,2,3.---): 


vi -B( BS)" v2 = 0. (27) 


Two eigensolutions can be provided automatically: 


Vs = ) and pvg= (*) 


Vuws =O and vVus=0. (28) 


Thus, we have to find four more eigensolutions to this six-dimensional 
eigenproblem. 

The reality of 7 and the special form of 8 enable us to simplify the 
problem still further. Suppose we have a solution to (21). It follows 


then that 
0 -N\/ E ' (e0\/f E 
(0 )(-a)=—(0 i) 


Thus, for each eigensolution with wave velocity vw, there is an analo- 
gous solution that travels in the opposite direction. 

In summary, of the six solutions to the eigenproblem (20) for a 
specified, real 7, two are trivial and have zero wave velocity, two are 
positive and correspond to two polarized solutions propagating in the 
+n direction, and the last two are negative, and, consequently, travel 

in the —7n direction. 

- It is worthwhile noting that we have included nonreciprocal cases, 
since Faraday rotation, the traditional nonreciprocal effect, is mani- 
fested by complex, yet Hermitian, permittivity and/or permeability 
tensors. These effects do not seem to cause the wave velocities in the 
—n direction to differ from those in the +7 direction. The nonreciprocal 
effects must be manifest in the differing polarizations between the 
forward and the backward directions. 

Polarization is specified usually by the orientation of the electro- 
magnetic field components normal to the relevant velocity direction 
(here n). Thus, from eqs. (13) and (14) we should concentrate upon D, 
when k is real. In general, the components of D, say, are complex; then 
the polarization should be considered elliptical. The special case of 
D « D* is called “linear polarization.” Since D is confined to the plane 
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perpendicular to k, its description for a specified 7 can be reduced 
always to two complex vector components; this two-dimensional rep- 
resentation is known as the Jones vector.”® 

Nonreciprocity is the inequivalence of wave velocity (or attenuation, 
which we ignore here) for a given polarization for the +7 directions. 
The reason that Faraday rotation is nonreciprocal is that the polari- 
zation for a given magnitude of wave velocity in the +7 directions are 
not the same; the Jones vector representation for the +7 directions 
associate right-handed elliptical polarization to left-hand elliptical 
polarization in the opposite direction. Two polarizations in the +’ 
directions would be considered the same if the corresponding D fields 
obeyed the proportionality 


D, o De. (30) 


Thus, nonreciprocity for solutions consistent with (20) could occur 
only when D* is not proportional to D; no nonreciprocity occurs for 
modes that are linearly polarized. It also appears to be a consequence 
of the form of (20) that no polarization-independent nonreciprocity 
can be produced in homogeneous media. However, if the medium is 
bianisotropic, then polarization-independent nonreciprocity is possible 
(see Ref. 7). 

The orthogonality relations (26) and (27) derived earlier, can be 
used to show that the two polarizations associated with the two 
different wave velocities in a given 7n-direction are “perpendicular.” 
Let us call these solutions, “fast” and “slow.” By assuming that w 4 0, 
we can derive from the orthogonality relation 


E fast * Datow 7 Hést . Batow = 0. (31) 


Since East, —Heast is again a solution with yet another v,,, we can derive 
from orthogonality 


Efist* Dstow — Hfast: Batow = 0. (32) 

Thus, we can conclude from (31) and (32) that 
Efist* Datow = 0 (33) 
Hist: Baow = 0. (34) 


We see from (31) also that there is no interference in the energy 
density between the fast and slow solutions for a given direction, 7. 
Suppose at some location 


VP = APtast + OVaow, (35) 
then 
U= Yop. Bp = ¥| al? rhst- BP rast + | b|?v%iow* BPstow- (36) 
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Usually, the literature considers the more restrictive cases,® where 
either jf or € is isotropic and conclude 


Dist: Datow = 0(?) 3 
which is not always true for more general cases such as are considered 
here. 
IV. THE POYNTING VECTOR 


If we consider the time dependence of U, we derive: 


dU 1dpr*B gt dBv Re( +7) 
= = py — |. 


dé 2 dt 2° at 





pi a cl er (37) 


By using Maxwell’s eqs. (1) and (2), we obtain further: 
dU/dt = Re(E*-V x H — H*-V x E) = V-[Re(E* x H)]. (38) 


As we have identified dU/dt with the divergence of a “flux,” this flux 
should be identified with Poynting’s vector. With this in mind, we 
make two definitions 


G=kE* xH (39) 
and 
S=ReG. (40) 


For a plane wave of real k, G is independent of time and space. 
However, it may be complex. The real part of G, namely, S, is 
Poynting’s vector and is always real, obviously. 

The vectors G and G* are useful because they are “dual” to k. 
Consider 


G x D = H(E*-D) — E*(H-D) 
= H(E*-D) + w ‘E*(H-k x H) = UH. 
Consequently, 
G x D= UH. (41) 
A similar derivation shows 
G* x B= —UE. (42) 


These two equations are analogous to (12) and (11) and suggest a 
series of steps that are usable for calculating the ray velocity. 

The ray velocity vector, v,, which is parallel to the Poynting vector, 
S, is defined as 


v, = S8/U (43a) 
vr = |v,|. (43b) 
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The Appendix proves that the Poynting vector is indeed the ray 
direction. 
Unlike the theory for k, we will not be able to assume that G is real. 


Nevertheless, we define 


£=G/|ReG| (44) 
and 
§=Reg=S/|S|. (45) 
With these notational conventions, we restate (5) and (6) 
é* x B=—-v,'E=-—v,;'é"D (46) 
éx D=v;'H=v,'fi'B. (47) 


These two relations can be rewritten as an eigencondition: 


0 o oO 0 gi —-é} D» 
0 0 0 —8 z 0 é x Dy, 
0 0 oO gf -gF 0 D: 
0 -& & 0 0 0 B: 
& 0 -& 0 0 0 By 
—By é x 0 0 0 0 B. 


D; 
D, 
=viip | o ). 8) 
By 
B. 
And a terser notation, analogous to (22), could be 
I'v = v;'27v, (49) 


where I’ is Hermitian; 2 is positive definite and Hermitian because 


2U = v*-Bv (50) 


in analogy to (23). 

In accordance with the analogous discussion of the solutions (22), 
the six real eigenvalue solutions to (49) will consist of two zero v;", 
two positive v;', and their two negatives. Similar orthogonality rela- 
tions can also be generated. 

The duality® would be complete if ¢ were equal to §. If k and # were 
real, we could expect that G = S. However, we have demonstrated by 
computer that a real k does not always lead to a real G. 

Thus, we must conclude that to solve for v; starting with a given §, 
we must find the appropriate values of Im ¢ that produce a real k in 


942 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 


a test such as 
Uk = w(D* x B). (51) 


We shall show now that if k is complex, then its imaginary part is 
perpendicular to S. From (11) and (12), we can generate 


—E*-k x H = wk*-D (52) 
H*-k X E= wH*-B. (53) 

Adding (52) and (53), we obtain 
k-(E* X H+ E x H*) = wo(E*-D + H*-B). (54) 


Using the definitions of Poynting’s vector and energy density, we can 
restate (54) as 


k-S = wl. (55) 


Because w, U, and S are always assumed to be real, we can conclude 
that 


S-Imk=0. (56) 


V. REFLECTION AND REFRACTION 


We can find the reflection and refraction of a plane wave at a flat 
interface between two different, homogeneous media of the general 
type we have been considering. 

Figure 1 indicates what we should expect at the interface between 
two media. The kp is the propagation vector of the incident wave. 
Usually, there will be two reflected waves, represented by ki and ko, 
and two refracted (transmitted) waves, represented by ks and ky. 
Details of the four “scattered” waves are determined by boundary 
conditions. 

To determine kj, ke, ks, and ky, for a particular ko, the boundary 
conditions we need to consider are the equality of the w(k) associated 
with all five k vectors. In addition, because the geometry possesses 
two-dimensional translational symmetry in the plane of the interface, 
we must match the components in the plane of the interface for all the 
k vectors. These constraints are sufficient to determine the four 
“scattered” k vectors for a given Ko. 

Finding the scattered k vectors amounts to specifying the compo- 
nents of y = k/w that are in the plane of the interface, y,, and solving 
an eigenproblem that provides 


yi! = w/|k-£.|. (57) 


We define x,, for notational convenience, as a unit vector perpendicular 
to the interface and directed in the same sense as the incident radiation. 
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(REFLECTION) 


(REFRACTION) 





(INCIDENT) 


€4 4 €2 2 
INTERFACE 


Fig. 1—Scattering at an interface between two anisotropic media. 


From (11) and (12), we have 


-yx H=€E (58) 
yX E=j7H. (59) 
Rearranging, we have 
-y.f.xXH=¢E+y,xH (60) 
y.t. XE = fH — y, x EB. (61) 


Thus, we have a general eigenproblem again: 


0 oO 0 oO ££, -%:) 
0 0 0 —KX.2 0 Ce 
0 oO 0 #, -&. 0 | 
0 —KX.2 Ky 0 0 0 
Vics 0 —Xix 0 0 0 
—KXiy Lig 0 0 0 0 
0 0 0 0 -Y Ye 
0 0 0 Yiz2 OO Ye 
<i 0 0 0 -Yir Yr 0 
=e a v 
s B 0 Viz Yi 9 0 0 
—yz 0 Yr 0 0 0 
Yly —Yz 0 0 0 0 
=a.v=y.'B.p. (62) 
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This eigenproblem has a lot of similarity to (20) in that the matrices 
on each side of the equation are Hermitian. Further, we expect two 
extraneous solutions for which the eigenvalues yz’ = 0 and the eigen- 


vectors 
bs = @. and p= i) (63) 


are analogous to (28). We expect two “reflected” solutions with nega- 
tive 
s. = £.-8. (64) 


The sign of y. is not so relevant because k and S are rarely parallel in 
anisotropic media. 

We expect a maximum of two “refracted” solutions when we use @2 
and jf2 in (62) and look for solutions with positive $.. We shall have 
less than the maximum number of refracted solutions when y is getting 
large enough so that #. ceases to be positive definite. 

If 8. is neither positive definite nor negative definite, a derivation 
similar to (24) does not exist and some of the solutions for yz’ may be 
complex. If y. is complex, then the corresponding E and H fields 
satisfy from (26): 


v* Gv =0. (65) 
Therefore, we have an evanescent solution: 
§.=0. (66) 


Note that the sign of y. is not the criterion for acceptable solutions 
to (62), rather the sign of $. is the criterion. 


VI. SCATTERING AMPLITUDES 


The solution of (62) has provided the values of ki, ke, k3, and ky for 
a given ko. Let us label the corresponding eigenvectors: 1, v2, v3, V4, 
and ». We shall assume no particular normalization for the eigenvec- 
tors, but suppose for a given vo there will be particular amplitudes a;v; 
for each scattered wave. Naturally, the solutions for the a; are depend- 
ent upon the normalizations of the »;. 

From Maxwell’s eqs. (1) and (2), we must satisfy continuity of the 
components of E and H in the plane of the interface. These conditions 
may be restated as 


@.V0 = &.(—a1P1 — Agv2 + A3v3 + A4V4). (67) 


Equation (67) can be restated as four simultaneous equations by 
suitable definitions. In the following, assume the index, 2, takes on the 


ELECTROMAGNETIC PROPAGATION 945 


integer values 1 through 4. Let 


b; = vi -A.v (68) 
Cij = Vi -OLD;. (69) 
Then it follows that 
b= y Ci Ay. | (70) 
j=l 


The form of the Hermitian, 4 x 4 matrix, (c;;), is simplified somewhat 
by orthogonality relations: 


Cig = Cor = C34 = Cag = 0. " (71) 


Thus, by solving (70), we obtain the four coupling coefficients, a;. 


Vil. RAY TRACING 


We envision that our results will find general applicability in prob- 
lems where “ray-tracing” is to be performed through anisotropic/ 
gyrotropic media and where the relative attenuation of alternative 
paths is to be explored. We shall indicate how to use the preceding 
results to this end. 

As indicated in the discussion following equation (50), there are 
difficulties in working directly with the Poynting vector, S. Even so it 
is the more macroscopic variable as it is the ray direction, as opposed 
to k which depends upon the wavefront normal. Most of the difficulty 
is resolved if we make the assumption (which does not seem too 
restrictive in practice) that the initial medium of the ray is to be 
isotropic (such as air, glass, index matching fluid, etc.). 

For isotropic media, the k direction is identical with the ray direc- 
tion. It is relatively straightforward to calculate y, at the first interface 
when the initial medium is isotropic. By solving (62) and, subsequently, 
calculating the corresponding 8; and, if desired, the amplitudes, a:, we 
can decide which of the scattered ray(s) we may wish to trace further. 

After we have chosen which scattered wave to follow, we use the 
Poynting vector to trace the ray to the next interface. We know the k 
of this ray and are not faced with the awkward task of reconstructing 
it from other information. If we are interested in exact phase relation- 
ships, these are easy to obtain: 


p(second interface) = exp[i(k- Ar) ]»(first interface) . (72) 


The spatial displacement from the first to the second interface has 
been designated Ar. 

We can calculate the y, at the second interface and proceed to solve 
(62). We calculate the new §S; of the scattered waves, etc., and choose 
which one we will follow to the next interface. This process can be 


946 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 


iterated as often as necessary to trace the ray of interest through an 
arbitrary optical system made up of homogeneous media and inter- 
faces. 


Vill. SUMMARY AND CONCLUSIONS 


We have developed a general formalism suitable for computation in 
the analysis of optical propagation through a succession of anisotropic, 
gyrotropic, homogeneous media. Such problems arise in a large variety 
of optical devices that are made of materials such as vic, LiNbOs, 
rutile, and calcite. 
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APPENDIX 


Equivalence of Ray Direction and §. 


We shall develop a first-order perturbation theory of the general 
eigenproblem. The result is directly applicable to the calculation of 
group velocity. When the permeability and permittivity are assumed 
to be frequency independent, the group velocity must be parallel to 
the ray velocity. 

Suppose we wish to perturb the eigenproblem: 


Av; = NBvi, (73) 
where Aand Bare Hermitian. In addition, Bis positive definite so that 


a complete set of eigensolutions with real A; is expected. 
Our perturbation assumption is that the replacement of 


A>A+c«A’ (74) 

requires the corresponding substitutions: 
Nim A+ APH + (75) 
er ae (76) 


Therefore, 


(A+ eA’)(vitevit---)=A;tAl+---)Blvit evit---). (77) 


Expanding and collecting terms with common powers of €, we obtain 
for €° the original (73); we obtain for e': 


Avi + A’v; =A:Bv! + NBv;. (78) 
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Taking the dot product with vi and cancelling terms: 


vi-A’v; = dvi By;. (79) 
Therefore, we can calculate A; from 
Ae: 
N= nee (80) 
vi. By; 


The group velocity is calculated by differentiating w(k) with respect 
to the components of k.’° The w dependence is determined from an 
eigenproblem of the form: 


0 0 0 OO k -k 
0 0 0 —k, 0 kh. 
—k, 0 

ie ek, by . 0 0 poem. en 
kz 0 —k, 0 0 0 
_ k, O 0 0 0 


Thus, for example, the x-component of the group velocity, vz: is 
determined from (80) and (81): 


00 00 0 0 
00 00 0 1 
de 00 00 -1 0 
00-10 00 
01 00 0 0 
= (E3H, — EFH, — HSE, + HiE,)/2U =S8,/U. (82) 
Thus, we have demonstrated: 
Ss 
=< >= re 83 
Veg U V. (83) 
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A Primal Algorithm for Finding Minimum-Cost 
Flows in Capacitated Networks 
With Applications 
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Algorithms for finding a minimum-cost, single-commodity flow in 
a capacitated network are based on variants of the simplex method 
of linear programming. We describe an implementation of a primal 
algorithm which is fast and can solve large problems. The major 
ideas incorporated are (i) the sparsity of the network is used to reduce 
the time and computer storage space requirements; (ii) basic solutions 
are stored compactly as spanning trees of the network; (iii) a candi- 
date stack is used to allow flexible strategies in choosing an arc to 
enter the basis tree; (iv) the predecessor and thread data structures 
are used to efficiently traverse the tree and to update the solution at 
each iteration; (v) rules are implemented to avoid cycling or stalling 
caused by degeneracy; and (vi) piecewise-linear, convex arc costs are 
handled implicitly. The Primal Network Flow Convex (PNFC) code 
implements this algorithm and three examples, from communication 
networks, that can be solved with PNFC are discussed: (i) solving the 
area transfer problem; (ii) scheduling the collection of traffic data 
records; and (ili) planning the placement of pair-gain systems. 


l. INTRODUCTION 


This paper describes an implementation of the network simplex 
method for solving the transshipment problem. The problem consists 
of finding a minimum-cost, single-commodity flow in a capacitated 
network satisfying supply-and-demand constraints. The familiar trans- 
portation, assignment, and shortest route problems are all related to 
the transshipment problem. The area of network-flow theory has been 
well studied and has proven useful in many areas of applications. (See 
Refs. 1 to 7.) 
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A number of different methods have been proposed over the years 
to solve network-flow problems. These include primal, dual, primal- 
dual, out-of-kilter, negative-cycle, and scaling methods. A survey of 
computer codes is presented by Charnes et al.® Until recently, primal- 
dual approaches were believed to be superior to the others. However, 
much work has been done involving data structures to reduce the 
computation time and space requirements for primal-network flow 
codes. The primal approach is a specialization of the bounded-variable, 
primal-simplex algorithm of linear programming which takes advan- 
tage of the special structure of network problems. We describe an 
implementation of a primal algorithm that is fast and can solve large 
problems. The major ideas used are as follows: 

(t) The sparsity of the network is used to reduce the time and 
space requirements for solving network-flow problems. 

(iz) Basic solutions are stored compactly as spanning trees of the 
network. 

(tit) A candidate stack is used to allow flexible strategies in choosing 
an arc to enter the tree. 

(tv) The predecessor and thread data structures are used to effi- 
ciently traverse the spanning tree and to update the solution at each 
iteration. 

(v) Rules by Cunningham®”* are implemented to avoid cycling or 
stalling caused by degeneracy. 

(vi) Piecewise-linear, convex arc costs (or, equivalently, multiple- 
parallel arcs) are handled implicitly. 

The Primal Network Flow Convex (PNFC) code is an implementation 
of this algorithm and is described in more detail by Seery.’’” An 
overview flowchart of the PNFc code is shown in Fig. 1. We will only 
discuss the INITIAL and SOLVE modules, which perform analogous 
functions to Phases I and II of the simplex method of linear program- 
ming. 

The problem of finding a minimum-cost flow in a network with 
linear costs is described in detail in Section II. The primal network 
approach is then outlined in Section III. We describe the data struc- 
tures used to implement the algorithm in Section IV. The major steps 
of determining an initial solution, finding an entering arc and leaving 
arc, and exchanging these arcs are detailed in Sections V, VI, VII, and 
VII, respectively. Extensions to other than linear arc costs are de- 
scribed in Section IX. Finally, in Section X, we describe three problems 
in communication networks that can be modeled as network-flow 
problems: 

(z) Solving the area transfer problem. 

(ti) Scheduling the collection of traffic data records. 
(tit) Planning the placements of pair-gain devices. 
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Fig. 1—Overview of the PNFC program. 


il. PROBLEM FORMULATION 


We refer the reader to Harary” for graph-theoretic concepts neces- 
sary for defining the problem; however, we do define a few of these 
concepts here. Consider a directed graph G = (N, A) consisting of a 
set N of nodes and a set A of ordered pairs of nodes called arcs. 
Associated with each node 7 is a value d;, representing the amount of 
supply of a commodity available at the node. A negative “supply” 
represents a demand for the commodity at the node. Each arc (i, /) 
has a lower bound a;; and upper bound 0, on the number of units of 
the commodity that is allowed to flow along the arc from node 7 to 
node j. Also, there is a shipping cost of cj per unit of flow along each 
arc (1, 7). An example network is shown in Fig. 2, with each node i 
labeled by dj, and each arc (i, 7) labeled by (arc no. aj, by, ci). A 
feasible flow x = (xij) assigns flow values xj to arcs (1, 7) which satisfy 
the conservation-of-flow constraints eq. (2) and the capacity require- 
ments eq. (3). An example of feasible flow for the network in Fig. 2 is 
shown in Fig. 3, where each arc is labeled by its flow value. The 


(1,1,10,29) (8,2,10,11) 





(ARC NO., LOWER BOUND, UPPER BOUND, COST) 


Fig. 2—Network. 





eum BASIC ARCS 
——*s» NONBASIC ARCS 


Fig. 3—Feasible flow. 
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objective is to find a feasible flow that minimizes the total cost of eq. 
(1). This problem corresponds to the following Primal Linear Program 
(PLP): Minimize 


 CyKy (1) 
LPC JEA 
subject to 
xyj- Yo xed iEN (2) 
FO NEA RAR DEA 
a;Sxysby (Ls) EA. (3) 


The summations are taken only over existing arcs. This emphasizes 
the fact that sparsity of the network is exploited by only storing data 
for arcs that are present in the network and only performing operations 
on these arcs. 

We do, of course, allow multiple (or parallel) arcs between nodes 1 
and j. These arcs can each be given explicitly with associated costs 
and bounds and would be treated as distinct arcs. In Section IX, we 
describe how the implementation of this algorithm handles parallel 
arcs implicitly. 

It is well known that PLP can be reformulated with either all supply 
values equal to zero or all lower bounds equal to zero. However, since 
some problems are formulated naturally in one way and some in the 
other way, we allow both the supply and lower-bound values to be 
nonzero. 

We assume that the network is connected. If not, the problem 
decomposes into independent subproblems on the connected compo- 
nents. We make this assumption merely to simplify the discussion. In 
fact, the algorithm described here converts the given network into a 
connected one by appending a dummy node and dummy arcs as part 
of obtaining an initial feasible solution. 


lll. PRIMAL NETWORK ALGORITHM 


The primal-simplex algorithm of linear programming was specialized 
to the transshipment problem by Dantzig*. It leads to the “stepping- 
stone” approach as described by Charnes and Cooper.’ This approach 
was generally believed to be computationally inferior to the existing 
primal-dual methods. However, recent work on primal network ap- 
proaches has taken advantage of data structures to streamline the 
calculations involved and reduce the storage requirement. In this 
section, we outline the major facts related to the application of the 
primal-simplex approach to network problems. A more complete anal- 
ysis is provided by Bradley et al."* 
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3.1 Basic solutions 


The primal network algorithm is a specialization of the bounded- 
variable, simplex method. This allows the capacity constraints eq. (3) 
to be handled implicitly in the computations. The remaining con- 
straints eq. (2) can be written in matrix notation as Ax = d, where A 
is the node-arc incidence matrix for the network and d is the vector of 
supply values. A solution x to eq. (2) is called basic if the columns of 
A associated with the variables x, which are strictly between their 
bounds, are linearly independent. The value of the basic variables are 
uniquely determined once the nonbasic variables have been specified 
to be equal to their lower or upper bounds. Basic solutions correspond 
to the bases of the column space of A. Also, the feasible basic solutions 
are exactly the extreme points of the polyhedron described by eas. (2) 
and (3). Hence, for any (linear) objective function eq. (1), some basic 
solution is optimal. 

Each column of A corresponds to an arc in the network. Therefore, 
since the network is connected, the basic solutions correspond to 
subsets of arcs that form spanning trees in the network; the remaining 
arcs have flow values that are restricted to their lower or upper bound. 
The darker arcs in Fig. 3 form a basis tree. 


3.2 Optimality criteria 


Assign dual variables (or node potentials) u; to each constraint in 
eq. (2), and variables uj; and w; to each lower- and upper-bound 
constraint, respectively, in eq. (3). The Dual Linear Program (DLP) for 
PLP is defined as follows: Maximize 


yy duit Y ayvy- Y bywy (4) 
iEN (i,jJEA (i, EA 
subject to 
Uj — Uj + UZ — Wi S Ci (,j)EA (5) 
Vi, Wi = O (i,j) EA. (6) 


Using linear programming duality theory, the complementary slack- 
ness conditions that optimal solutions to PLP and DLP must satisfy are 
given by 


Xij(Ciy — Ui + Uj — Vy + Wy) =O, 0 

and 
vii(xiy — ay) = 0, e) 
wy(by ~ xy) = 0. 


By defining the reduced cost of an arc (i, 7) to be 
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Ci = Cj — Ui + U;, (10) 


the optimality conditions for a pair of primal and dual solutions 
become 


if ay<xj<6b,; then cy =0, (11) 
if x;=a; then cy=0, (12) 

and 
if xy=b,; then cy=0. (13) 


3.3 Basis exchange 


Consider a basic feasible solution x* for PLP. The node potentials 
u* are determined using eq. (11) for the basic arcs; these values are 
uniquely determined once any one node potential value is fixed. If eqs. 
(12) and (18) are satisfied, then the optimality of the solutions x* and 
u* follows from their feasibility and complementary slackness. Other- 
wise, an entering (nonbasic) arc and a leaving (basic) arc are identified 
and exchanged to obtain a new basic solution. This process is repeated 
until optimality is reached. 

A nonbasic arc that violates eq. (12) or (13) is selected to enter the 
basis; that is, arc (i, 7) is chosen if 


xyj=ay and cy<0, or 
xy=by; and cy>0. 


The entering arc forms a unique cycle (or loop) when added to the 
basis tree. By interpreting wu; — u; to be the cost of sending one unit of 
flow from node 7 to node i, using arcs only in the basis tree, ci < 0 
implies that increasing the flow along arc (i, 7) and sending this flow 
around the loop from j back to i will strictly decrease the cost. [This 
is true also when decreasing the flow on arc (i, 7) when ¢; > 0.] Since 
this conserves the flow at each node, the amount of flow change on the 
loop is limited only by the capacity constraints on the arcs of the loop. 
One of the arcs whose flow reaches its bound first is selected to leave 
the basis. For example, increasing the flow on the nonbasic arc (4, 7) 
in Fig. 3 increases the flow on arc (3, 4) and decreases the flow on arcs 
(3, 6) and (6, 7). Arc (3, 6) reaches its bound first. Exchanging arcs 
(4, 7) and (3, 6) forms the new basis tree shown in Fig. 4. 


IV. DATA STRUCTURES 


The contemporary work on network algorithms has focused on using 
data structures to improve the effectiveness of primal network codes. 
Since network problems are typically sparse, with many node pairs 
having no arcs between them, we only store the data and perform 
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Table |—Data for the network of Fig. 2 


Arc 1 2 3 4 5 6 7 8 9 10 Ii 12 13 
Lower Bound 1 4 0 0 0 0 90 2 0 0 0 0 1 
Upper Bound 10 10 10 2 10 10 10 10 «10 1 10 10 2 
Cost 29 15 14 138 17 #15 10 11 #20 21 11 25 #18 


calculations for the arcs actually present in the network. The arcs are 
stored in a list, with all of the arcs pointing out of a node grouped 
together. The costs and bounds are kept in similar arc-length lists. 
This is illustrated in Table I for the network in Fig. 2. 

The representation of a basis as a tree results in a compact format 
for storing the relevant data. This format is shown in Table II for the 
basis tree in Fig. 4. A basis tree is specified by identifying one node to 
be the root node and indicating a predecessor NPRED(z) for each 
node 1, which is the next node after z on the unique path from 7 to the 
root node in the tree. The physical direction of the arc between i and 
NPRED(i) is distinguished by setting NDIRECT(z) equal to 1 if the 
arc points from i to NPRED(z) and to —1 otherwise. The variable 
DUALV(i) is the dual variable associated with each node i. The 
amount of flow on the arc between i and NPRED(i) is given by 
FLOW(z). 

The loop formed by an entering nonbasic arc (i, 7) and the basis tree 
can be traced, in one pass, by following the predecessor paths from i 
and j back towards the root until these paths meet. The depth 
NDEPTH(z) of a node 7 is the number of nodes on the path from i to 
the root. The predecessor and depth arrays are used to proceed up 








a tHE THREAD 


7 
N67 


Fig. 4—Basis tree. 


956 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 


Table lla—Data structures for the basis tree of Fig. 4—node data 


Node 1 2 3 4 5 6 7 8 
NPRED 0 3 1 3 3 7 4 5 
NDIRECT 0 1 -1 —l1 —1 1 -1 —l1 
DUALV 0 1 15 32 25 18 43 36 
FLOW 0 3 5 2 3 1 3 1 
NDEPTH 1 3 2 3 3 5 4 4 
LTHRED 3 5 2 7 8 0 6 4 


Table Ilb—Data structure for the basis tree of Fig. 4—arc data 
Arc L- 2 3 4 65 6 7 8 9 10 11 12 18 
KASE 0 -1 -1 1 #-1 O -1 -1 0 1 =--1 #-1 «0 


these paths, staying at the same depth on each path so that the search 
ends at the node where the paths meet, thus forming a loop. 

After the entering and leaving arcs have been determined, it is 
necessary to update the basis tree. This involves traversing all of the 
successors of a node to update their dual variable and depth values. A 
useful tool for doing this is the “thread.” The thread LTHRED(z) of 
a node i labels the nodes of the tree in “preorder” according to the 
following recursive rule: First label the root. Then label the nodes in 
each of the subtrees belonging to the immediate successors of the root 
in preorder. (The order in which these subtrees are visited is immate- 
rial.) The thread is illustrated by dotted arcs in Fig. 4. 

A final data structure keeps track of the status of each arc. The 
array KASE(LN) is set to 0 if arc LN is currently nonbasic at its lower 
bound, 1 if it is nonbasic at its upper bound, and —1 if it is basic. 

These data structures are illustrated in Tables IIa and Ib for the 
basic solution in Fig. 4, with node 1 as the root. 


V. INITIAL SOLUTION 


The primal network-flow algorithm proceeds from one basic solution 
to another until an optimal solution is found. It is necessary to provide 
an initial feasible solution to begin this iterative process. Several 
methods are available for obtaining initial feasible solutions.'* We use 
the “Big-M” method of artificially creating a feasible flow by adding 
a dummy node and dummy arcs. This method is easy to implement 
and understand, and works well.’ 

The method begins by initialing the flow in all arcs to be at their 
lower bounds, that is, all arcs are nonbasic at their lower bound and 
the KASE array is set to zero. This induces a net supply at each node 
i with value 

SPLYNT(i)=di- YY ay+ YY an. 


J, JEA kik, DEA 
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For the network in Fig. 2, the values are shown in Table III. A feasible 
flow is formed by adding a dummy node as the root of the tree and 
dummy arcs between the dummy node and all the other nodes. These 
dummy arcs form the basis tree. The direction of the dummy arc in 
the tree is towards the root if the net supply of the node is nonnegative, 
and away from the root otherwise. The amount of flow is equal to the 
absolute value of the net supply of the node. The cost of the dummy 
arcs is set to be very large if the dummy arc points away from the root 
and is set to zero otherwise. The dummy arcs have a lower bound of 
zero and an upper bound of “infinity.” The dual values are calculated 
by setting the dual variable for the root to zero and solving for the 
others using eq. (11). The initial basis tree for the example network in 
Fig. 2 is shown in Fig. 5 with basis array data in Table IV. Later stages 
of the algorithm serve to remove the dummy arcs from the basis. If 
the algorithm terminates with positive flow in a dummy arc, then 
there is no feasible solution to the original problem. 


Vi. ENTERING ARC 


Suitable candidates for arcs to enter the basis are nonbasic arcs 
(i, 7), which are either at their lower bound with negative reduced cost, 
or at their upper bound with positive reduced cost, i.e., eq. (12) or eq. 
(13) is violated. Many strategies have been proposed and tested for 


Table IIl—Values of the net supply for the network in Fig. 2 
Node 1 2 3 4 5 6 7 8 
SPLYNT 1 5 1 —1 —3 0 —2 —1 


ZAIN 


Fig. 5—Initial basis tree. 


Table IV—Initial basis tree 


Node 1 2 3 4 5 6 7 8 9 
NPRED 9 9 9 9 9 9 9 9 0 
NDIRECT 1 1 1 —l —1 1 —1 -1 0 
DUALV 0 0 0 fe fo 0 oo oo 0 
FLOW 1 5 1 1 3 0 2 1 0 
NDEPTH 2 2 2 2 2 2 2 2 1 
LTHRED 2 3 4 5 6 7 8 0 1 
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selecting an arc to enter the basis; these range from choosing the first 
violating arc to choosing the most violating arc. We use a candidate 
stack approach’*’® which allows flexible strategies in controlling the 
choice of an incoming arc. 

The usage of the candidate stack is illustrated in Fig. 6. The 
candidate stack is replenished during a major iteration whenever it is 
empty or when an upper bound MXITER on the number of consecu- 
tive minor iterations is reached. The stack is filled from the arc list 
which has all of the arcs out of a node grouped together. The reduced 
costs are computed for the arcs out of a node i, and the nonbasic arc 
(i, 7), with the largest violation (if any) among these, is placed on the 
stack. This is repeated for nodes z + 1, i + 2, etc., until either all nodes 
have been processed or the stack has reached its maximum allowable 
size of MXSTAK. The next major iteration begins with the node 
where the last major iteration left off. This periodic examination of all 
arcs helps prevent “stalling,” which is a long sequence of pivot ex- 
changes that does not improve the cost of the solution.’ 

Once the stack has been formed in a major iteration, minor iterations 
are performed to choose the nonbasic arcs from the stack with the 
largest violation to enter the basis. At each minor iteration, the arcs 
on the stack are checked for violations. If an arc no longer has a 
violation, it is removed from the stack. Once the stack becomes empty, 
or MXITER consecutive minor iterations are performed, the stack is 
rebuilt in a major iteration. 


Vil. LEAVING ARC 


A nonbasic arc to enter the basis tree is chosen because it violates 
eq. (12) or (13). This implies that moving its flow value away from its 
bound and updating the flow values of the arcs on the newly formed 
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Fig. 6—Candidate stack. 
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loop strictly decreases the cost of the current solution. The flow is 
changed as much as possible until one of the arcs on the loop reaches 
its bound and is chosen to leave the basic solution. This is accomplished 
in one pass by tracing the loop using the predecessor and depth arrays. 

To illustrate this, consider the basis tree in Fig. 4 with arc (6, 8) 
chosen as the entering arc. Arc (6, 8) can have its flow increased by 
one unit before reaching its upper bound. This induces a supply at 
node 8 and a demand at node 6. The path on the side of the loop with 
the induced supply is called the plus path, and the other side is called 
the negative path. The predecessor array is used to trace the paths 
from 6 and 8 towards the root, until the loop is formed where the paths 
join at node 3. Along the way, the amount of change in the flow value 
before an arc reaches its bounds is computed for each arc on the loop. 
The depth array is used to alternatively move up each of the two paths 
checking for a common node. The arcs in Fig. 4 are examined in the 
order (6, 7), (5, 8), (4, 7), (8, 5), and (8, 4); their maximum amount of 
change in flow before reaching a bound is 1, 1, 7, 7, and 8, respectively. 
Hence, arcs (6, 7), (5, 8), and the entering arc (6, 8) all reach their 
bound after a flow change of one unit. Any of these arcs could be 
chosen to leave the basis. 

In our algorithm, the leaving arc is chosen by using Cunningham’s 
rule’ which is designed to avoid “cycling” caused by degeneracy. This 
approach hinges upon maintaining basis trees that are “strongly” 
feasible; that is, the arcs in the basis tree directed towards the root are 
required to have flow values strictly less than their upper bounds, and 
arcs directed away from the root have flow values strictly greater than 
their lower bounds. This assures us that a positive amount of flow can 
always be pushed up the tree. We note that the initial solution 
procedure always constructs a strongly feasible tree and that the tree 
in Fig. 3 is strongly feasible. 

In the case of a tie among several arcs as candidates to leave the 
basis, the choice is made using the following priority: First, choose the 
candidate arc which is closest to the root on the plus path (if any). 
Next, choose the entering arc, if it is a candidate. Finally, choose the 
arc on the negative path furthest from the root. In the example cited 
above, arc (6, 7) would be chosen. This rule provides the only choice 
which will result in the new basis tree remaining strongly feasible. 


Vill. BASIS EXCHANGE 


Exchanging the entering and leaving arcs requires updating the 
basic solution. For example, choosing arc (1, 4) to enter the basis tree 
in Fig. 4 results in the choice of arc (1, 3) to leave the basis. This forms 
a new basis tree as illustrated in Fig. 7. This changes the data arrays 
associated with the basic solution. Specifically, the nodes on the loop 
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(b) 
Fig. 7—Basis exchange. (a) Old basis tree. (b) New basis tree. 


must be traced (as described before) to update the predecessor, direc- 
tion, flow, and thread array (NPRED, NDIRECT, FLOW, and 
LTHRED) values. In the example, nodes 1, 3, and 4 are on the loop. 
In addition, the dual variable and depth array (DUALV and 
NDEPTH) values must be updated for the nodes in the subtree which 
is moved because of the basis exchange. In our example, the subtree 
moved is rooted at node 3 and consists of nodes 2, 3, 4, 5, 6, 7, and 8. 
The thread array LTHRED is used to identify the nodes in the subtree 
rooted at a node 7 as follows: 
(t) Label the root node i to indicate that it is in the subtree. Let 

J = LTHRED (i). 

(ui) If PRED(,) is labeled, then label j, and set 7 = LTHRED(,), or 
stop with all nodes in the subtree labeled as such. 

This procedure works because the nodes in the subtree appear 
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consecutively on the thread immediately after the root, and because 
the predecessor of a node always appears on the thread before the 
node itself does. The method used to update the basic solution values 
is an enhancement of a procedure described in detail by Jacobsen.” 
The change in status of the entering and leaving arcs is reflected by 
updating their values in the KASE array. The new array values for 
the basic solution of Fig. 7 are shown in Table V. The optimality of 
this solution may be checked by noting that the flow and dual variables 
are feasible to PLP and DLP, respectively, and the complementary 
slackness conditions eqs. (11), (12), and (18) hold. 


IX. EXTENSION TO OTHER COSTS 


We have described an algorithm for finding a minimum cost flow in 
a capacitated network with linear costs. The current implementation 
actually handles piecewise-linear, convex costs or, equivalently, mul- 
tiple parallel arcs between two nodes. We describe how this is accom- 
plished implicitly without any additional work required. 

Consider the piecewise-linear, convex arc cost shown in Fig. 8. It 
consists of three line segments with cost slopes equal to c), c2, and cs, 
respectively. The lower bound on the allowable flow is bo, with three 
break points of bi, be, and b3. This is equivalent to having three parallel 


Table V—Data structure for the new basis tree of Fig. 7 


Node 1 2 3 4 5 6 7 8 
NPRED 0 3 4 1 3 7 4 5 
NDIRECT 0 1 1 ~1 —1 1 -1 -1 
DUALV 0 —2 12 29 22 15 40 33 
FLOW 0 3 1 2 3 1 3 1 
NDEPTH 1 4 3 2 4 4 3 5 
LTHRED 4 5 2 3 8 0 6 7 

Arc Data 
Arc J 2 3 4 5 6 7 8 9 10 Il 12 13 
KASE -1 0 -1 1 -1 0 -1 -1 =O 1 -1l -1 0 


(3, 0, 63-b2, ¢3) 


(2,0,62-64,¢2) 
Mi sail 


(1,69,4,C4) 
(d) 


Fig. 8—Piecewise-linear, convex arc cost and equivalent parallel arc formulation. 
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arcs (one for each line segment) as shown in Fig. 8, with arcs labeled 
by arc no., lower bound, upper bound, and cost. 

This situation can be handled implicitly by expanding the definition 
of the status of an arc LN as follows: 


KASE(LN) 


ee k_ if arc LN is nonbasic with flow value b; 
~—k if arc LN is basic with flow value between bz-; and 6; ° 


For this case, convexity and separability guarantees that at most one 
arc of each parallel arc group will be basic; therefore, we can now 
proceed with the algorithm as if only one arc exists between the nodes. 
The appropriate cost and bound values to use for this arc depend only 
on the status of the arc. 

This feature is also useful for problems with convex arc costs. We 
can always approximate a convex cost function by a piecewise-linear, 
convex cost function as shown in Fig. 9. This can be used to obtain 
approximate solutions. Meyer’s alternative method” approximates the 
convex cost by a piecewise-linear, convex cost, with only two line 
segments whose breakpoints vary from one iteration to the next. 
Meyer’s approach is guaranteed to converge to the optimal solution. 

Concave arc costs model economies of scale and are discussed by 
Erickson et al.,"° where a dynamic programming solution procedure is 
proposed. An alternative method of solution is to use a branch-and- 
bound approach, solving the subproblems using a linear approximation 
to the concave cost. Such an approach has been applied successfully 
for fixed-charge transportation problems by Barr et al.” 


X. APPLICATIONS 


Numerous applications of network-flow models can be found in the 
references provided. We briefly describe three applications in com- 
munication systems. 


COST 





bo by b2 b3 ba FLOW 


Fig. 9—Piecewise-linear approximation of a convex cost function. 
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Our first example illustrates a situation where piecewise-linear, 
convex arc costs arise in a natural way. Hammer and Segal”! formu- 
lated a model for handling “area transfers” in a local area. The area is 
subdivided into districts j which are served by wire centers 7. Currently, 
w, working lines (or wire pairs) are in use between 7 and /, with 
additional s; spare lines unused. We let 


dj = y, wi 


represent the current demand at district 7. Future demand for lines at 
district 7 can be satisfied from wire center i by using the spare lines at 
a cost of A; per line or adding additional lines beyond the available 
spare at a larger cost of Aj + 6,. Alternatively, an area transfer may 
occur if a wire center 7’s capacity b; on the number of lines it can serve 
is exceeded. This involves disconnecting a line from wire center 1 to 
district 7 at a cost of ui per line and reconnecting it to another wire 
center. Given new demands d; + A; for lines at each district 7, the 
problem of assigning customers to wire centers (with possible area 
transfers) at minimum total cost is a transportation problem with a 
piecewise-linear, cost structure for each 1, 7 pair as shown in Fig. 10. 
The slope of the three line segments and the break points are indicated 
on the figure. Denote by x; the change in the line assignments between 
wire centers i and districts 7, and by c;(x) the separable, piecewise- 
linear cost functions of Fig. 10. Then, the problem is to minimize 


yd C(x) (14) 
tg : 
subject to 


Y (wy + xy) = a+ Aj Vj 
i (15) 


» (wi + xij) =< b; Vie 
J 






COST 


a cs a 20 


Wij Wij Sij NUMBER OF WIRES 
WORKING WORKING 
WIRES + SPARE 
WIRES 


Fig. 10—Piecewise-linear, convex cost structure for the area transfer model. 
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Wy S Xi Vij. (16) 


In general, this formulation fits transportation models in which convex 
costs are incurred whenever one perturbs (increases or decreases) the 
established pattern of the distribution (w,). A project scheduling 
problem is formulated by Lawler’ as a network-flow problem with two 
piecewise-linear, convex cost segments. 

A second example involves scheduling the collection of call records 
in private communication networks and is discussed in detail by 
Monma and Segal.” A private communication network consists of tie 
trunks interconnecting various locations. Calls between two locations 
overflow to the toll network when all connecting tie trunks are busy. 
Each location is served by a Private Branch Exchange (PBx) which 
generates a call record for each call originating at its location. 

A central PBx is equipped with several ports, each capable of 
simultaneously collecting records from different locations. According 
to a prescribed schedule, the central PBx initiates calls and begins 
transferring call records. This continues until all records stored at the 
start of the polling have been collected. 

We describe a network-flow model used to find a good polling 
schedule by illustrating it in Fig. 11 for two offices (PBxs) and 48 
polling periods in the day. Each period J is represented by a node J on 
the right side of the figure. Each office J is represented by a collection 
of 48 nodes arranged in a particular configuration as shown on the left 
side of the figure. Each node J in the configuration for office I 
represents the period J at the office. The arc directed horizontally into 
this node J indicates that a known number of records are generated 
during period J at office I. The arc directed from a period J to a 
following period J + 1 node signifies records in storage at the end of 
period J which are carried over to the start of period J + 1. We note 
that the records left in storage at the end of the last period in the day 
carry over to the first period of the following day. 

The arc directed from the period J node of office J to the period J 
node on the right side of the figure indicates the polling of records 
from office J at the start of period J. The arc directed from the period 
J node to the sink node represents all of the records polled in period 
J among all offices. The arrrow out of the sink node means that all 
records generated must be polled at some time during the day. 

Capacities on the arcs depend on the amount of call-record storage 
space available at each office and the polling capacity of the ports at 
the central psx. The cost per record for polling office J during period 
J depends on the probability that a call from the central PBx to office 
I will overflow to the toll network and the cost of such a call. This 
network-flow model is used in an interactive way to produce polling 
schedules (which are determined from the flows on the arcs from 
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PERIOD 
NODES 


Fig. 11—Network-flow model for polling call records with two offices and 48 periods. 


period J office J nodes to period J nodes) of minimum cost which 
avoid losing call records. (Call records may be lost if a PBX generates 
them faster than they are polled, thus, overflowing the storage capacity 
of the PBX.) 

Our final example involves a capacity-expansion problem in the local 
network, which connects subscribers to the local switching office. 
Elken” discusses a mathematical programming formulation of this 
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Fig. 12—Network-flow models for placement of pair-gain devices. 


problem. We will describe a network-flow model by way of a stylized 
and simplified example. The network is composed of cable sections 
which connect pairs of nodes (see Fig. 12). The nodes represent 
demand points for groups of subscribers. The solid arcs represent cable 
sections. Each cable section may consist of several parallel existing or 
proposed cables of various sizes (and costs). The dashed arcs represent 
the possible placement of pair-gain systems (in which a small number 
of wire pairs are used to serve a larger number of customers by means 
of multiplexing or concentration). The piecewise-linear, convex costs 
arise from the multiple parallel cables in each cable section. A solution 
to the problem represents a least-cost expansion of cables and place- 
ment of pair-gain systems. 

Finally, we mention some computational experience with the PNFC 
computer code. Elken™ solved several examples of the previously 
described local network problem with about 50 nodes, 150 arcs, and 2 
piecewise-linear sections. The average CPU time was 1.5 seconds. The 
scheduling of the call-records problem was solved hundreds of times 
for problems with about 500 nodes and 1000 arcs and required an 
average of 3.5 seconds. The largest example tested was a 400-node 
assignment problem with 40,000 arcs; this problem was solved in about 
80 seconds of cPU time on the IBM Amdahl computer. 
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Improved Reconstruction of DPCM-Coded 
Pictures 
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A new scheme for the reconstruction of DPCM-coded signals is 
presented. In this scheme, instead of assigning one representative 
level to the prediction error whenever it is in a range determined by 
the decision levels of a DPCM quantizer, surrounding local picture 
structure is used to improve the reconstruction. No extra transmission 
is required. Mean-absolute-reconstruction error is decreased by over 
20 percent, and the picture quality is significantly improved in flat, 
as well as high, detailed areas of still pictures and sequences of 
pictures containing motion. 


I. INTRODUCTION 


In most picture communication systems that employ any type of 
quantized encoding, samples are sequentially encoded, transmitted by 
the channel and reproduced at the receiver. Pulse-Code Modulation 
(pcm) systems assign one of the many quantization levels to the 
amplitude of each sample, whereas differential pulse-code modulation 
(DPCM) systems assign a quantizer level to the prediction error of each 
sample.’ Thus, in pcM systems, no use of the correlation between the 
adjacent picture samples is made either at the transmitter for coding 
or at the receivers for display. In ppc, the “previously” coded sample 
values are used for computing the prediction of the “present”? sample 
at the transmitter, and thus correlation of the present element with 
the previous pels is utilized. However, no use of the “future” samples 
is made at the transmitter or at the receiver. 

In this paper, we present techniques for coding and display that use 
correlation between the present sample, and its past and future neigh- 
bors. At the transmitter, this improves the prediction; and at the 
receiver, this results in a more accurate reproduction of the intensity 
for display. The technique can be used at the receiver, or at the 
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transmitter, or both. For example, its use only at the receiver amounts 
to postprocessing of the coded data received from a standard DPCM 
transmitter. Each sample that is DPCM coded with a coarse quantizer 
is known at the receiver to be in a range that is determined by the 
prediction and the coarseness of the quantizer. Knowing the ranges of 
surrounding neighboring samples, a new representative value can be 
assigned to each sample. 

We describe a technique for such an assignment and give results of 
computer simulations on still pictures, as well as a sequence of frames 
containing motion. Simulations show that in areas containing high- 
spatial correlation, improvement using our technique is the greatest. 
In pPcM systems, this corresponds to samples whose prediction error 
occupies the “inner” (close to zero) levels of the quantizer. Coarseness 
of the quantizer in these areas shows as granular noise, which is 
reduced by our technique. However, this reduction occurs without the 
blurring of high-detailed areas associated, for example, with low-pass 
filtering. Comparison of the quality of the pictures shows that edge 
busyness and slope overload are also reduced, but this improvement is 
not reflected proportionately in the mean-square-reconstruction error. 


ll. ALGORITHM 


In the basic DPcM system, a prediction of the present sample (e.g., 
sample X in Fig. 1) is made from the previously encoded information 
that is already transmitted to the receiver. Thus, in Fig. 1, the intensity 
of pel X can be predicted by A, or (A + D)/2, where A and D are the 
reconstructed values of samples A and D. The error resulting from the 
subtraction of the prediction from the actual value of pel X is quantized 
into a set of discrete amplitude levels. Thus, a quantized form of 
(X — A) may be sent if the so-called previous element predictor is used. 
The quantizer levels are represented as binary words of either fixed or 
variable length and sent to the channel for transmission. If the predic- 
tion error, e, falls within the two consecutive decision levels e,, and és, 
then the receiver knows the intensity of the sample to be within the 


Fig. 1—Pel configuration for intraframe processing. 
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range [(P + e;), (P + e2) |, where P is the prediction value. Traditionally, 
a representative level e,, is chosen to represent e in the range [e,e2] 
and the reconstruction at the receiver is taken to be (P + e,). Such a 
reconstruction is based on the representative level e,, which is chosen 
to minimize certain average error.”** However, pictures are highly 
nonhomogeneous and nonstationary; therefore, benefit can be derived 
by choosing a representative level based on the local characteristics of 
the picture. However, if no additional information is transmitted to 
the receiver, the local characteristics have to be estimated based on 
the quantized information. The problem then is to estimate the inten- 
sity of pel X, 
(t) knowing that it is in the range [P + e:, P + e2] and 

(ti) knowing the ranges of the surrounding correlated elements, 
such as A, M, F, E, D, P, O, N, etc. 
Since elements A, M, ---, N are correlated with element X, their 
range information should give a better estimate of the pel intensity at 
X. Since an accurate model for the picture signal does not yet exist, we 
could not apply estimation theory. Instead, we tried several intuitively 
reasonable techniques, and the following appears to be the best for the 
set of pictures used. The following steps are used in the reconstruction 
algorithm. 

1. Use a set of predetermined surrounding elements, e.g., A, M, F, 
E, D, P, O, and N. . 

2. Determine the normal (i.e., using a standard DPcM receiver) 
reconstructed value for X, and the surrounding elements. 

3. Knowing p, the prediction of X, determine the range for X, i.e., 
[p + e1, p + ee], in addition to its normal reconstructed value p + e,. 

4. Let n and m be the number of surrounding elements having 
normal reconstructed value less than and more than (p + e,;), respec- 
tively. Then, the new reconstruction for element X is* 


n(p + e:) + m(p +e,) 


X= , if n>m 
(n + m) 

_ n(p + er) + m(p + es) iad a 
(n + m) 

= p + (e1 + €2)/2 if n=m. 


Thus, the new reconstruction is a weighted sum of the endpoints of 
the range, where the weights are determined by the surrounding 
elements. Since the new reconstruction maintains the value of the 


* If n is close to m and changes slightly from sample to sample, the above reconstruc- 
tion may show jumps from sample to sample. This provides a dithering effect that 
appears to improve the picture quality. 
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element X within the range [P + e1, P + e2], the filtering action of the 
weighted averaging process does not blur the edges significantly. It is 
possible to choose the “surrounding” elements that have characteris- 
tics similar to the element X, so that the averaging does not produce 
any noticeable artifact. 

The algorithm described above is primarily for postprocessing at the 
receiver for better reconstruction. It is also possible to use the algo- 
rithm for preprocessing at the transmitter. In traditional DPCM, recon- 
structed values are used for prediction. Therefore, it is possible to use 
only the previously transmitted pels to improve the reconstruction and 
then use this better-reconstructed value for prediction. The constraint 
that only previously transmitted values can be used for improved 
reconstruction limits the improvement when intraframe predictors are 
used. As an example, if previous element A is used for prediction of X, 
then better reconstruction of A can be performed by using only 
surrounding transmitted neighboring pels, such as B, E, F, A, etc. 
Since other neighbors, such as elements X, P, and O cannot be used, 
smaller improvement in reconstruction is obtained. However, in the 
case of frame-to-frame coding, substantial improvement is possible. 
For example, in Fig. 2, previous frame element, X, can be reconstructed 
using all the elements in the 3 X 3 neighborhood, since they would be 
available at the receiver at the time of coding present element X. 


lil. SIMULATION RESULTS 


We performed computer simulations to evaluate the improvements 
using our reconstruction techniques. The still frames used were 
256 X 256 pel arrays with each pel quantized to 8 bits (levels 0 to 255). 
These are shown in Figs. 3 and 4. The sequence of frames containing 
motion, consists of 64 frames called “JUDY.” Four frames from this 
sequence are shown in Fig. 4 of Ref. 5. For simulation with still frames, 
two predictors, A (previous element), (A + D)/2 (planar, two dimen- 
sional) were used. Three symmetric, even-level quantizers were used. 
Their positive decision levels are given below: 


Q@: : 0, 10, 255, 4 levels. 
Q2 : 0, 3, 9, 29, 255, 8 levels. 
Qs; : 0, 10, 25, 43, 255, 8 levels. 


The mean-absolute-reconstruction error (i.e., the magnitude of the 
difference between the original and the reconstructed picture) was 
calculated for both pictures with standard pPcm: and the improved 
DPcoM described in the previous section. As mentioned before, since the 
intraframe coding does not allow significant improvements at the 
transmitter using our techniques, only postprocessing at the receiver 


972 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 








SCAN LINES .“~ | 
IN FIELD 2 l 

\ \ 
\ 
\ 





XN 
~~ PREVIOUS FRAME 


\ 
_ SCAN LINES 
7 IN FIELD 1 


~~—= CURRENT FRAME 


Fig. 2—Pel configuration for interframe processing. 


was performed. Three cases were considered, depending on the number 
of neighbors used. Case 1 used three neighbors, A, X, and M; Case 2 
used five neighbors, A, X, M, E, and O; and Case 3 used nine neighbors, 
A, X, M, F, E, D, P, O, and N. Tables I and II show the mean-absolute- 
reconstruction error for various cases. We note that a larger number 
of neighbors improves the performance of our postprocessing tech- 
niques. Large improvements in mean-absolute error are obtained for 
quantizers that have coarse inner levels. Thus, when quantizers Q and 
Qs are used, we get about 20 percent improvement over the standard 
ppcm. Although the mean absolute error does not show improvement 
with quantizer Q>, the quality of the picture is improved considerably 
because of improved reconstruction of the low-contrast areas, as well 
as edges. 

Quantizers for intraframe predictive coders usually contain fine 
inner levels to reduce the visible granular noise in flat areas. If only a 
small number of levels can be used, then one has to trade off granular 
noise for other distortions such as edge busyness. It may be possible 
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with our technique to use a quantizer with coarse inner levels so that 
better edge reproduction is possible, since granular noise would be 
mitigated to a large extent. 

Simulations of our scheme for a sequence of frames involved an 
improved DPCM transmitter, as well as receiver. Thus, referring to Fig. 
2, three types of schemes were simulated. In the first, conditional 
replenishment with previous frame predictor (i.e., X for prediction of 
X) and a moving area threshold of 2 (out of 255) was simulated. This 


Table [—Comparison of mean-absolute error for circuit diagram 


Number of Number of Number of 
Neighbors = 3} Neighbors = 5 | Neighbors = 9 









Predictor Used 













Standard 


F . DPCM 
Quantizer used: Q; Improved 5.84 4.92 
DPCM 
Standard 3.13 2.31 3.13 2.31 
Quantizer used: Q2 DPCM 
Improved 3.18 2.72 3.01 2.40 
DPCM 
Standard 2.92 3.09 2.92 3.09 
: ; DPCM 
Quantizer used: Qs Improved 2.95 3.01 2.07 2.08 
DPCM 


Table Il—Comparison of mean-absolute error for checkered girl 


Number of Number of Number of 
Neighbors = 3 | Neighbors = 5 | Neighbors = 9 


Predictor Used 











Standard 


‘ DPCM 
Quantizer used: @: Improved 5.72 3.78 5.07 3.13 
DPCM 
Standard 296 1.71 2.96 1.71 
DPCM 
Quantizer used: Qe Improved 3.04 1.98 294 1.80 
DPCM 
Standard 2.96 2.79 2.96 2.79 
Quantizer used: Q Be 
al Improved 3.24 2.80 244 2,20 


DPCM 
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was used for comparison. The second scheme was identical to the first, 
except that the reconstruction of X was done by using the 3 x 3 
neighborhood only when X was a moving area pel. If X is not a moving 
area pel, then it is reconstructed by using its previous frame value. 
Thus, only postprocessing at the receiver is used. The third scheme 
used an improved reconstruction value from the previous frame also 
for prediction. All three schemes used quantizer Q3. The results are 
shown in Figs. 5, 6, and 7. Figure 5 shows the mean-absolute-recon- 
struction error averaged over all the pels as a function of the frame 
number. It is easy to see that there is about 20 percent reduction in 
mean-absolute error because of the improved reconstruction (scheme 
2). Also, the improved prediction and reconstruction (i.e., scheme 3) 
reduces the error further by about 10 percent. Figure 6 shows the 
improvement when restricted to moving area pels. Since the number 
of moving area pels differs for the three schemes, total reconstruction 
error is shown. Considerable improvement is now seen by postprocess- 
ing alone and pre- and postprocessing. The total prediction error for 
all the moving area pels for schemes 1 and 3 are plotted in Fig. 7. Thus, 
instead of using X for prediction as in scheme 1, if we improved the 
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Fig. 5—A plot of the mean-absolute-reconstruction error as a function of the frame 
number. 
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predictor using the 3 X 3 surrounding elements around X, the predic- 
tion is improved by over 15 to 20 percent. However, much of this 
improvement is apparently included in only postprocessing with no 
change in predictor (scheme 2). The pictures produced by the three 
schemes were viewed in informal sessions. It was concluded that the 
“dirty window” effect in moving areas associated with coarse quanti- 
zation (scheme 1) was removed to a large extent by improved post- 
processing (scheme 2). In many cases, the blur in areas of high-contrast 
edge motion, resulting from a quantizer with insufficient dynamic 
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Fig. 6—A plot of the total error in the moving area versus the frame number. 
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Fig. 7—A plot of the total absolute-prediction error as a function of the frame number. 


range, was also reduced. Although the results are not shown when 
quantizer @Q2 was used, we found that there was improvement in 
absolute-reconstruction error, and the picture quality showed signif- 
icant improvements. 


IV. SUMMARY AND CONCLUSIONS 


We have presented a new method for reconstruction of DPcM-coded 
samples. Instead of using one representative level whenever the pre- 
diction error is within a range, we use local properties of the picture 
signal to derive an improved reconstruction. No extra transmission is 
required. Reconstruction error is decreased by over 20 percent, and 
picture quality is improved significantly. It is possible to use quantizers 
that have coarse inner levels and to use our techniques to reduce the 
visible granular noise. Therefore, quantizers with fewer levels can be 
used. Improvement is obtained for coding of still frames, as well as 
sequence of frames containing motion. 
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An Improved Training Procedure for 
Connected-Digit Recognition 


By L. R. RABINER, A. BERGH, and J. G. WILPON 


The “conventional” way of obtaining word reference patterns for 
connected-word recognition systems is to use tsolated-word patterns, 
and to rely on the dynamics of the matching algorithm to account for 
the differences in connected speech. Connected-word recognition, 
based on such an approach, tends to become unreliable (high-error 
rates) when the talking rate becomes grossly incommensurate with 
the rate at which the isolated-word training patterns were spoken. 
To alleviate this problem, an improved training procedure for con- 
nected-word (digit) recognition is proposed in which word reference 
patterns from isolated occurrences of the vocabulary words are com- 
bined with word reference patterns extracted from within connected- 
word strings to give a robust, reliable word recognizer over all normal 
speaking rates. The manner in which the embedded-word patterns 
are extracted was carefully studied, and it is shown that the robust 
training procedure of Rabiner and Wilpon can be used to give 
reliable patterns for the embedded, as well as the isolated, patterns. 
In a test of the system (as a speaker-trained, connected-digit recog- 
nizer) with 18 talkers, each speaking 40 different strings (of variable 
length from 2 to 5 digits), median-string error rates of 0 and 2.5 
percent were obtained for deliberately spoken strings and naturally 
spoken strings, respectively, when the string length was known. Using 
only isolated-word training tokens, the comparable error rates were 
10 and 11.3 percent, respectively. 


I. INTRODUCTION 


Recently, several algorithms for recognizing a connected string of 
words based on a set of discrete word reference patterns have been 
proposed.’ Although the details and the implementations of each of 
these algorithms differ substantially, basically they all try to find the 
optimum (smallest distance) concatenation of isolated-word reference 
patterns that best matches the spoken word string. Thus, the success 
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of all these algorithms hinges on how well a connected string of words 
can be matched by concatenating members of a set of isolated-word 
reference patterns. Clearly, for talking rates that are comparable to 
the rate of articulation of isolated words [e.g., on the order of 100 
words per minute (wpm)], these algorithms have the potential of 
performing quite accurately.’* However, when the talking rates of the 
connected-word strings become substantially larger than the articula- 
tion rate for isolated words, for example, around 150 wpm, then all the 
pattern-matching algorithms tend to become unreliable (yield high- 
error rates). The breakdown mechanism is easily understood in such 
cases. The first effect of a higher talking rate is a shortening of the 
duration of the word within the connected-word string. This shortening 
is highly nonlinear (i.e., vowels and steady fricatives are first shortened 
and then they are reduced, whereas sound transitions are substantially 
unaffected) and is not easily compensated by the dynamic time warp- 
ing (DTW) alignment procedure, which has its own inherent local and 
global warping path constraints. The second pronounced effect is the 
sound coarticulation that occurs at the boundaries between words in 
the string. As the talking rate increases, the amount of coarticulation 
increases to the point where the isolated-word reference patterns no 
longer adequately match the words in the sequence, especially at the 
beginning and end of words. 

The above discussion emphasizes the fact that the currently avail- 
able connected-word recognition systems are inadequate for most 
reasonable talking rates; i.e., only a highly motivated talker who spoke 
the connected-word strings in a careful, deliberate manner, would 
achieve high accuracy. The next question then is what can be done to 
increase the reliability of the connected-word recognizer at normal 
talking rates? The answer to this question is the topic of this paper. 
The proposed solution is an improved training procedure in which the 
isolated-word reference patterns are supplemented with word reference 
patterns extracted from connected-word strings. Although there has 
been no publication on the use of embedded word training patterns for 
connected-word recognition, the Nippon Electric Company’s DP-100 
training procedure has, in demonstrations, used embedded digits, 
obtained as the middle digit of three-digit strings, to supplement the 
isolated digits in the training set.”® 

The purpose of this paper is to show how word reference patterns 
for a connected-word recognizer can be obtained in a reliable and 
robust manner from connected-word strings. We also show how such 
embedded training patterns can be combined with standard isolated- 
word reference patterns to provide a training set which is capable of 
recognizing connected-word strings spoken at normal talking rates. In 
particular, we consider recognition of connected-digit strings of from 
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two to five digits. It is shown that in a speaker-trained test, the 
connected-digit recognizer achieved a 97.5 percent median-string ac- 
curacy on normally spoken digit strings using the improved training 
set, whereas an 88.7 percent median-string accuracy was obtained 
using only isolated-digit templates. 

The outline of this paper is as follows. In Section II, we describe the 
improved training procedure and discuss the important issue of how 
to extract an embedded-word reference pattern that is reliable and 
robust. In Section III, we describe and give the results of an experiment 
to independently evaluate the automatic training procedure of Section 
II. In Section IV, we present results of a connected-digit recognition 
test using 18 talkers, each of whom used the improved training pro- 
cedure. Finally, in Section V, we summarize our results, and discuss 
alternative procedures for improving the connected-digit recognition 
algorithm. 


ll. THE IMPROVED CONNECTED-WORD TRAINING PROCEDURE 


To understand the improved connected-word training procedure, we 
must first review briefly our current implementation of the connected- 
word recognizer. Thus, we will focus on the level-building (LB) algo- 
rithm of Myers and Rabiner,°*®” as this is the algorithm for which all 
results will be given. 


2.1 Connected-word recognition using a LB approach 


Assume we are given a test string T = T(m), m = 1, 2, --- , M, where 
T represents a string of words of unknown length. The connected- 
word-recognition problem consists of finding the sequence of reference 
patterns, R* = Raga) ® Rye) @ --- © Raw of arbitrary length L which 
best matches (minimum distance) the test string T. (The operation ® 
is a concatenation operator). The sequence R* defines the L reference 
patterns, Roi), Rgw,..., Riz), which, when concatenated, best matches 
(over all lengths L and over all possible reference patterns) the un- 
known test sequence T. 

The basic procedure for finding R* is to solve a time-alignment 
problem between T and R* using ptw methods. The LB algorithm is 
basically an efficient procedure for solving the associated DTW min- 
imization.”*® Although the details of the exact procedure need not be 
reviewed here, there are several important properties of the LB algo- 
rithm that will be used in this paper. Therefore, we first enumerate 
these properties, which include: 

(t) The LB algorithm finds the optimum matching string of every 
length, L words, from L = 1 to L = LMAX (as set by the user). 

(it) For each string match found by the LB algorithm, a segmen- 
tation of the test string into appropriate matching regions for each 
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word in the string is provided. (We will rely on this property to 
formulate the embedded training procedure). 

(iii) For every string length, L words, a list of the best Q strings 
(i.e., the Q lowest distance L-word strings) can be obtained. 

(tv) By using the flexible parameters of the LB algorithm, namely 
dr, (a range of frames at the beginning of a reference pattern which 
can be skipped), dr, (a range of frames at the end of a reference pattern 
which can be skipped), and denn, (a range of frames at the end of the 
test that can be skipped), modifications of the reference patterns, or, 
equivalently, large discrete jumps in the warping path, can be made to 
partially account for word-boundary and word-shortening effects. 

Figures 1 and 2 summarize the above properties in a fairly concise 
manner. Figure 1 shows a sequence of DTW paths corresponding to the 
best L-word sequence, the best L-1 word sequence, etc. For the best L- 
word sequence, the segmentation points corresponding to the bound- 
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Fig. 1—Sequence of LB DTW warps to provide best word sequences of several different 
lengths. 
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Fig. 2—Summary of flexible LB parameters. 


aries of each of the L words in the test string are also shown. It should 
be clear that the best string of length g need not (and generally won’t) 
contain the same words as the best string of length g + 1. 

Figure 2 shows a summary of how the LB parameters dz,, 5r,, and 
denp can affect the warping path. It shows that the pTw path can 
literally jump over up to (rz, + 5z,) frames of the pattern R* at each 
discrete word boundary within R*. In this manner, a certain degree of 
word coarticulation and word shortening can be directly accounted for 
in obtaining the best matches to the test pattern. It has been shown 
previously®” that judicious use of the LB parameters greatly improves 
the performance of the LB algorithm for several connected-word rec- 
ognition tasks. 


2.2 Isolated-word training for connected-word recognition 


The “standard” reference set for the LB algorithm for speaker- 
trained use is the set of isolated-word patterns obtained via the robust 
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training procedure of Rabiner and Wilpon.’® Thus, for each word in 
the vocabulary, a single-reference token was obtained as the time- 
warped average of two versions of the word which was deemed suffi- 
ciently similar (based on the recognizer-distance measure). 

As discussed previously, the major disadvantage of isolated-word 
reference patterns is that when the talking rate of the test strings 
becomes much higher than the talking rate of the isolated-word 
patterns, reliability and accuracy of the connected-word recognizer fall 
dramatically. To alleviate this problem, we now describe the embed- 
ded-word training procedure. 


2.3 Embedded-word training for connected-word recognition 


The basic idea behind the embedded-word training procedure is that 
an “improved” set of word reference patterns could be obtained by 
combining the isolated-word reference patterns with word reference 
patterns extracted from actual connected-word strings. A second im- 
portant and related problem is deciding from which strings (i.e., which 
sequence of words) such patterns should be extracted. Finally, a third 
related problem is controlling the rate at which the talker speaks the 
strings from which the embedded words are extracted. 

Before describing the training procedure that was studied, it is 
worthwhile discussing the three points made above. The first point is 
perhaps the most fundamental one. In connected-word sequences, it is 
often difficult, if not impossible, to assign boundaries to words in the 
string. For example, in the sequence/one-nine/, the nasal boundary 
between the one and the nine is a shared boundary and can arbitrarily 
be assigned to either digit. As the rate of talking goes up, this problem 
generally is compounded. 

The second problem in implementing an embedded-word training 
procedure, namely, the choice of strings from which to extract the 
patterns, involves deciding whether or not to use strings with heavy 
word coarticulation. (Clearly, this problem is intimately related to the 
first problem.) For example, the string /616/ will provide a markedly 
different embedded digit one from the string /219/. Which of these 
sequences would provide the most useful output, in terms of providing 
an improved reference pattern for the digit 1, is unclear, and is a topic 
of investigation in Section III. 

The third problem, namely, the talking rate of the talker, again 
determines the difficulty in extracting the embedded-reference pattern. 
We will see later in this section that it is reasonable to consider both 
deliberate (i.e., careful articulation of words) and natural (i.e., normal 
articulation of words) talking rates for extraction of reference words, 
and that different analysis procedures should be applied in both cases. 

Based on the above discussion, a block diagram of the embedded- 
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reference word training procedure is given in Fig. 3. The philosophy of 
the training procedure is similar to that of the robust training proce- 
dure for isolated words.’° For each word in the vocabulary, a sequence 
generator (i.e., a talker) produces a string of words in which the desired 
word appears. The choice of word sequences will be discussed below. 
A DTw alignment procedure matches a set of concatenated word 
references, corresponding to the words in the string, to the test string 
T, thereby providing a segmentation of T. The pattern for the appro- 
priate reference word is extracted from T, based on the segmentation, 
and stored in a temporary word store. It is then compared to all 
previous occurrences of that word in the store, using another DTW 
alignment procedure. For each such comparison, a distance score is 
obtained. If any distance score falls below a specified threshold, then 
the pair of tokens giving the minimum distance among all versions in 
the store are averaged, after time alignment, and the resulting refer- 
ence pattern is saved in an embedded-word store. This procedure is 
iterated until an embedded pattern is obtained for each word in the 
vocabulary. 

The only unspecified aspect of the embedded-training procedure is 
the word sequence generator. Since we have tested the improved 
training procedure only on digit-sequences, we will focus our attention 
solely on this vocabulary. We have considered two sequence genera- 
tors—a noncoarticulated (NC) sequence generator, and a coarticulated 
(co) sequence generator. For both cases, the desired digit was the 
middle digit of a 3-digit sequence. The Nc sequences had the property 
that, at the boundary, the preceding and following digits had different 
manners of production than the middle digit. Similarly, the co se- 
quences had the property that either the preceding or the following 
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Fig. 3—Block diagram of embedded-digit training procedure. 
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Table I—Training sequences for embedded digits 
NC Sequences CO Sequences 





digit, or both, had a similar manner of production as the middle digit. 
Table I shows the actual training sequences for both the Nc and co 
cases. A total of 66 sequences* were defined for each case representing 
different environments for each of the digits in strings of a given type. 


ll, EVALUATION OF THE EMBEDDED-WORD TRAINING PROCEDURE 


To study the effectiveness of the embedded-word training procedure, 
four talkers (two male, two female) each trained a connected-digit 
recognizer in the following manner: 

(t) A set of isolated-digit templates was created for each talker 
using the robust training procedure of Rabiner and Wilpon.” A single 
template was created for each of the ten digits. An eleventh template 
was created for the digit 8 in which the talker was requested to speak 
the digit 8 without releasing the final /t/. For the normal 8 template, 
each talker was told to release the final /t/. We designate the isolated- 
digit training set as Is. 

(iz) Four sets of embedded templates were created for each talker, 
using the robust training procedure of Fig. 3. The four sets of templates 
differed in speed at which the talker spoke the three-digit training 
sequences (normal—nr, and deliberate—DL), and the degree of digit 
coarticulation (coarticulated—co, and noncoarticulated—Nc). Thus, 


* For the digit 8, we considered both medial sequences (in which no release of the 
final t was made) and final sequences (in which the final ¢ was asked to be released). For 
final sequences the digit 8 was extracted as the third digit in the string. 
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the four embedded training sets were denoted as CO.DL, CO.NR, NC.DL, 
and NC.NR. 

The composite reference sets used for testing were created by consid- 
ering various combinations of the isolated-digit set with one or more 
of the embedded-digit sets. 

Two test sets (Ts) of testing data were obtained. The first set, 
denoted as TS.DL, consisted of 40 randomly chosen (i.e., a different set 
of 40 strings for each talker) digit strings of varying length from two to 
five digits, each string spoken deliberately, that is, carefully articulated. 
The second set, denoted as TS.NR, consisted of the same 40-digit strings 
for each talker as in TS.DL but, instead, spoken at a normal rate. 

Before presenting the results of the evaluation tests, one point is 
worth noting. Because of the difficulty in accurately finding digit 
boundaries of a digit embedded in a string of digits, the accuracy of 
the entire training procedure is suspect since the test recognition 
scores could be poor because of unreliable embedded-digit boundaries. 
To check this premise, the test strings used in the embedded-digit 
training procedure were processed manually, as well as by the auto- 
matic algorithm. The manual processing consisted of examining energy 
plots of the 3-digit string and listening to synthetic versions of the 
processed digits. This processing was performed iteratively on each 
string until the experimenter found the most acceptable boundaries 
for each digit. At this point, the automatic processing of the training 
procedure took over the rest of the template creation process. 

A simple check on the accuracy of the automatic-word extraction 
process was made by comparing the word boundary frames of the 
automatic procedure with those of the manual procedure. Histograms 
were made for each talker for both the initial and final frames of the 
embedded digit. Typically, the frame error was less than one frame in 
about two-thirds of the cases, with somewhat larger errors for the 
remaining one-third of the cases. Figure 4 shows examples of such 
histograms for one talker. Shown in this figure are the histograms for 
the initial and final frame errors for two of the embedded-training sets. 

A further check on the accuracy of the fully automatic technique 
was obtained by processing, for recognition, both the automatically 
obtained embedded-digit reference sets, and the manually obtained 
embedded-digit reference sets. We now discuss the recognition test 
results. 


3.1 Connected-digits test evaluation of the training sets 
For each of the two test sets (TS.DL and TS.NR) and for each talker, 
the following reference sets were used and tested: 
1. 1s—Isolated digits only, one template per digit, eleven digits. 
2. CO.DL—Embedded digits only, obtained from coarticulated 
strings, deliberately spoken. 
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Fig. 4—Histograms of word boundary error (for one of the talkers) between manually 
and automatically determined boundaries for both (a) and (b) initial and (c) and (d) 
final boundaries for (a) and (c) co.NR and (b) and (d) NC.NR. 


3. NC.DL—Embedded digits only, obtained from noncoarticulated 
strings, deliberately spoken. 

4. 18 ® CO.DL—IS combined with Co.DL. 

5. Is ® NC.DL—IS combined with NC.DL. 

6. Is ® CO.NR—IS combined with Co.NR. 

7. Is ® NC.NR—IS combined with NC.NR. 

(Sets 4 to 7 had two templates per digit, eleven digits) 

8. Is 8 CO.NR © CO.DL. 

9. IS @ CO.NR @ NC.DL. 

10. 18 ® NC.NR © CO.DL. 

11. 18s ® NC.NR ® NC.DL. 

(Sets 8 to 11 contained three templates per digit, eleven digits). 
For reference sets 2 to 11 both automatically and manually obtained 
templates were used. 

Notice that not all single, and double combinations of the embedded 
sets were used. The sets that were omitted (e.g., CO.NR or NC.NR alone) 
were found to have high-error rates in preliminary tests; therefore, 
they were not fully evaluated. 

The results of the recognition tests are given in Tables II and III. 
Table II gives results for Ts.DL (the deliberately spoken strings) and 
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Table II—Recognition results (number of string errors) for different reference sets for TS.DL strings 


Double-Training Sets Triple-Training Sets 
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Talker ®@ ® 
Number CO.DL NC.DL|; CO.DL NC.DL CO.NR NC.NR|CO.DL NC.DL CO.DL 
Automatic endpoints— 1 0 1 0 0 0 1 0 1 
known-length strings 2 4 0 2 2 0 0 1 0 
3 5 4 1 3 3 2 0 2 
4 2 2 2 2 2 2 2 2 
Total 1 7 5 7 5 5 3 5 
Automatic endpoints—best 1 0 2 2 0 1 2 1 3 
string 2 4 0 2 2 0 1 1 0 
3 6 7 5 9 3 7 6 7 
4 9 8 0 8 9 ii 9 8 
Total 9 7 9 9 3 17 17 18 
Manual endpoints—known- 1 2 0 0 1 0 0 1 0 
length strings 2 1 0 0 0 0 0 0 0 
3 5 2 2 3 2 1 1 1 
4 3 2 2 2 2 2 2 2 
Total 1 4 4 6 4 3 4 3 
Manual endpoints—best 1 9 1 1 3 0 2 1 1 
string 2 3 3 3 0 0 2 2 2 
3 8 6 5 1 7 5 4 5 
4 1 7 9 8 0 6 8 7 
Total 1 7 8 2 7 15 15 15 
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Table Ill—Recognition results (number of string errors) for different reference sets for TS.NR strings 


Double-Training Sets Triple-Training Sets 
Single-Training Sets 


CO.NR CO.NR NC.NR 
Talker ®@ @ @ 
Number IS CO.DL NC.DL| CO.DL NC.DL CO.NR NC.NR|CO.DL NC.DL CO.DL 
2 6 2 0 
10 5 5 






































Automatic endpoints— 1 1 1 0 1 0 0 
known-length strings 2 1 3 6 2 1 3 1 

3 7 3 5 2 2 2 4 3 2 2 

4 4 3 5 4 1 3 I 4 1 1 

Total 23 17 17 8 7 1 7 9 6 4 

Automatic endpoints—best 1 5 2 1 1 0 2 0 0 
string 2 7 5 1 3 6 2 1 3 1 

3 6 2 4 6 8 9 7 8 5 

4 3 6 5 2 5 2 6 4 3 

Total 9 8 2 2 0 3 16 15 9 

Manual endpoints—known- 1 3 2 0 0 1 1 0 0 0 
length strings 2 3 3 1 3 2 2 1 3 1 

3 4 5 3 2 4 3 2 2 3 

4 8 6 3 1 2 1 3 1 1 

Total 8 6 7 6 9 7 6 6 5 

Manual endpoints—best I 5 5 1 0 1 1 1 0 0 
string 2 7 3 1 3 3 2 1 3 1 

3 6 7 6 6 2 7 11 10 7 

4 9 7 3 2 4 2 3 3 1 

Total 7 2 1 I ‘0 2 16 16 9 
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Table III is for Ts.NR (normally spoken strings). Results are given in 
this table as the number of string errors (out of a possible 40 strings) 
when evaluated using the LB DTW string recognizer. The results are 
given for the best string, regardless of actual string length, and for the 
best string with string length assumed to be known (KL) a priori. 
Scores are given for each talker, and a sum of total errors is given at 
the bottom of each table. A comparison is also provided, when possible, 
between manually (mM) and automatically (A) embedded template sets. 

The results of Tables II and III show conclusively that neither 
isolated- nor embedded-word templates, alone, are adequate for relia- 
ble recognition of connected-digit strings spoken at either deliberate 
or normal rates. It can be seen that the best overall scores were 
obtained by the triple combination sets with some small variations 
among them depending on the test set, and the talker. The double 
combinations provided scores almost as good as the triple combina- 
tions. 

For the question as to whether to use coarticulated or noncoarticu- 
lated sequences in the training, the results of Tables II and III are 
essentially ambiguous. The scores on the double and triple combina- 
tions favor neither type of sequence over the other for both test sets. 
Similarly, the question as to whether to use deliberately or normally 
spoken strings in the training set is also left unresolved by the data in 
Tables II and III. In the next section, we investigate these issues 
further in a larger test of the overall procedure. 

Perhaps the most significant conclusion from Tables II and III is 
that the error scores from the automatically extracted digit templates 
are essentially identical (to within small errors) to those of the man- 
ually extracted digit templates for all reference sets giving good rec- 
ognition scores. This result indicated that the automatic training 
procedure was a useful one for obtaining embedded-digit templates for 
connected-digit recognition systems. 


IV. LARGE-SCALE EVALUATION OF THE IMPROVED TRAINING 
PROCEDURE 

The results of the preliminary test indicated that the improved 
training procedure led to a reliable set of embedded-digit templates, 
and that when combined with the normal set of isolated-digit tem- 
plates, connected-digit, string-recognition scores greatly improved at 
normal speaking rates over those obtained using isolated-digit tem- 
plates alone. 

To provide a better understanding of the limitations of the improved 
training procedure, a larger evaluation test was performed in which 18 
talkers (nine male, nine female) each used the improved training 
procedure to provide digit reference patterns. Only two sets of embed- 
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ded patterns were extracted, namely, NC.DL and co.NR. The training 
sets studied in the evaluation included: 
(z) All three single trainings sets, namely, Is, NC.DL, and CO.NR, 
each of which consisted of 11 patterns (including the unreleased eight). 
(tt) The two combinations of isolated and embedded references, 
namely, Is ® NC.DL and Is ® Co.NnR. These sets consisted of 22 patterns. 

(iit) The triple combination of 1s ® Nc.DL @ co.NR. This set con- 
sisted of 33 patterns. 

As in the previous test, each talker provided two test sets of 40 
randomly selected connected-digit strings of from two to five digits. 
One set was spoken deliberately (TS.DL), the other set was spoken at 
a normal rate (TS.NR). The digit strings in both sets for the same talker 
were identical. Different talkers spoke different randomly chosen sets 
of digit strings. 

The recognition test consisted of using the LB-based DTw algorithm 
to match the spoken test strings by the best sequence of reference 
patterns, regardless of length. In performing the matches, the param- 
eters of the LB algorithm were set as follows: 

(t) For all references and test sets, the parameters Senp, Mr, and e 
were given values of denp = 4, Mr = 1.6, and e = 20.* 

(ii) The parameters dp, and dz, were made to vary with each subset 

of reference patterns in the following manner. For isolated-digit pat- 
terns in all reference sets, the values dz, = 4 and dr, = 6 were used. For 
NC.DL reference patterns, the values dr, = 2 anddér, = 3 were used. For 
CO.NR reference patterns, the values dz, = dr, = 0 were used. 
The logic for this choice was that the NcC.DL patterns could be shortened 
somewhat, but not as much as the isolated patterns. However, the 
CO.NR patterns could not be shortened at all since they came from 
highly reduced, normally spoken digit sequences. 

Before presenting results of the recognition tests, we first give some 
statistics on each talker’s rate of talking. 


4.1 Talking rate statistics 


The statistics on average talking rate for each of the 18 talkers for 
both Ts.DL and TS.NR strings are given in Table IV, and a summary of 
the overall average rate (as a function of number of digits in the string) 
is given in Fig. 5. The talking rates are given in terms of wpm. It can 
be seen that, for deliberately spoken strings, the average talker’s rate 
varies from 99 to 156 wpm (across the 18 talkers). Thus, a high degree 
of rate variability exists across talkers for deliberately spoken strings 
of digits. For naturally spoken digit strings, the average talking rates 


* These parameter values were chosen based on preliminary runs on connected-digit 
strings, as reported in Ref. 6. 
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Table |V—Average talking rates (wpm) for strings of varying length 
for TS.DL and TS.NR data 


TS.DL 


Number of Digits in String Number of Digits in String 


3 4 5 


Average 





135 


130 


= 
Np 
oa 


RATE (WPM) 
x 
oO 


160 





NUMBER OF DIGITS IN STRING 


Fig. 5—Plot of average talking rate as a function of the number of digits in the string 
for (a) TS.DL and (b) TS.NR data. 


vary from 118 to 193 wpm (across the 18 talkers), again pointing out 
the high degree of variability in normal talking rates. 

However, the plots in Fig. 5 show that when averaged across talkers, 
the talking rate for different length strings does not vary as markedly 
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as for different talkers. For deliberately spoken digit strings, the 
slowest average rate is 126 wpm for two-digit strings, and the average 
rate increases to 134 wpm for four-digit strings. Almost no increase in 
average talking rate is found for five-digit strings over that for four- 
digit strings. For normally spoken digit strings, the same trends of the 
average talking rate are seen across different length strings. Thus, the 
average rate for two-digit strings is 150 wpm and it increases to 170 
wpm for four-digit strings. However, the average rate for five-digit 
normally spoken strings (171 wpm) is essentially the same as for four- 
digit normally spoken strings. 


4.2 Recogniton test results on large data base 


The results of the recognition tests on the 18-talker data base are 
given in Tables V and VI, and are plotted in Fig. 6. Tables V and VI 
give average- and median-string error rates (averaged over talkers and 


— AVERAGE-18 TALKERS 


_— AVERAGE-17 TALKERS 


— AVERAGE-18 TALKERS 


STRING ERROR RATE IN PERCENTAGE 


oo AVERAGE-17 TALKERS 


~ AVERAGE-16 TALKERS 





NUMBER OF BEST CANDIDATES 


-Fig. 6—Plots of string error rates versus number of best candidates for (a) TS.DL and 
(b) TS.NR data using reference set Is ® NC.DL ® CO.NR. 
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Table V—Average- and median-string error rates (percentage 
errors) for TS.DL data for the three reference sets 
TS.DL 





Number of Best Candidates 








Reference Set Statistic KL 1 2 3 4 5 
Is Median 10.0 16.3 8.8 6.3 5.0 5.0 
18 Talkers 11.4 20.0 11.8 9.9 8.5 8.3 
17 Talkers 9.1 17.6 9.3 7.8 6.6 6.5 
16 Talkers 7.5 16.3 8.3 6.4 5.5 5.3 
Is Median 2.5 10.0 2.5 2.5 2.5 2.5 
® 18 Talkers 5.7 14.1 7.4 5.3 5.0 4.9 
NC.DL 17 Talkers 3.1 12.2 4.9 3.1 20 2.8 
16 Talkers 2.0 10.5 3.4 2.0 1.9 1.9 
IS Median 0 5.0 2.5 0 0 0 
® 18 Talkers 3.3 9.7 4.9 3.3 2.6 2.5 
NC.DL 17 Talkers 1.3 7.2 2.8 1.2 0.6 0.4 
® 16 Talkers Ll 5.8 1.9 0.5 0.3 0.2 
CO.NR 


Table VI—Average- and median-string error rates (percentage 
errors) for TS.NR data for the three reference sets 








TS.NR 
Number of Best Candidates 
Reference 
Set Statistic KL 1 2 3 4 5 
IS Median 11.3 13.8 10.0 7.5 7.5 7.5 
18 Talkers 16.3 19.0 11.7 10.3 10.1 9.7 
17 Talkers 13.1 15.9 9.3 8.1 7.9 7.5 
16 Talkers 13.0 15.5 9.1 8.0 7.8 7.3 
Median 5.0 7.5 3.8 3.8 3.8 3.8 
IS 18 Talkers 8.3 11.5 6.4 5.8 5.4 5.3 
® 17 Talkers 5.3 8.4 4.0 3.7 3.2 3.1 
NC.DL 16 Talkers 4.8 7.8 3.3 3.1 2.7 2.7 
Is 
® Median 2.5 2.5 1.3 0 0 0 
NC.DL 18 Talkers 5.8 8.1 3.8 2.9 2.6 2.5 
® 17 Talkers 3.7 5.4 2.1 1.5 1.5 1.3 
CO.NR 16 Talkers 3.4 4.5 1.6 1.1 1.1 1.1 


various length strings) for three of the reference sets studied* (is, Is 
@® NC.DL, IS ® NC.DL ® CO.NR) for TS.DL data (Table V) and Ts.NR data 
(Table VI). Included in the table are string error rates based on the 
top 1, 2, 3, 4, and 5 candidates (regardless of string length) and for the 
case in which the string length was known a priori (KL); that is, we 
only considered strings of the proper length. Results are given for the 
median error rate, the average error rate for all 18 talkers, the average 
error rate for 17 of the 18 talkers (omitting the one with the highest 


* These three reference sets were chosen as they gave the best recognition rates and 
were representative of previously used training methods. 
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error rate), and the average error rate for 16 of the 18 talkers (omitting 
the two talkers with the highest error rates). 

Figure 6 shows plots of the average and median error rates as a 
function of the top n candidates (n = 1, 2, 3, 4, 5) for both Ts.DL and 
TS.NR data using the reference set Is ® NC.DL © Co.NR, which provided 
the best overall results. 

The results of Tables IV and V and Fig. 6 show the following: 

(t) For deliberately spoken strings (TS.DL), the median-string error 
rate is 5 percent on the top candidate and falls to 2.5 percent on the 
top two candidates for reference set Is ® NC.DL ® co.NR. Using 
reference set Is alone, the median error rate is 16.3 percent on the top 
candidate and is still 8.8 percent on the top two candidates. Using 
reference set IS ® NC.DL, the median error rate is 10.0 percent on the 
top candidate and 2.5 percent on the top two candidates. 

(tt) For deliberately spoken strings, the median error rates with 
known-length strings are 0 percent for reference set Is ® NC.DL ® 
CO.NR, 10 percent for reference set Is, and 2.5 percent for reference set 
Is ® NC.DL. 

(tii) For deliberately spoken strings, the average error rate scores 
for all 18 talkers are significantly larger than the median error rate 
scores. However, when the talker with the highest error rate is omitted, 
ie., only 17 talkers are used), the average and median error rate scores 
are comparable. 

(tv) For normally spoken strings (TS.NR), the median-string error 
rate is 2.5 percent on the top candidate and falls to 1.3 percent on the 
top two candidates using reference set Is ® NC.DL @ co.NR. Using 
reference set Is alone, the median error rate is 13.8 percent on the top 
candidate and 10 percent on the top two candidates. Using reference 
set IS ® NC.DL, the median error rates are 7.5 percent on the top 
candidate and 3.8 percent on the top two candidates. 

(v) For normally spoken strings, the median error rates with 
known-length strings are 2.5 percent, 11.3 percent, and 5.0 percent, for 
reference sets IS ® NC.DL © CO.NR, IS, and IS ® NC.DL, respectively. 

(vi) For normally spoken strings, the average error rate scores for 
all 18 talkers are significantly larger than the median error rate scores. 
Again, when the talker with the highest error rate is omitted (the same 
one as for deliberate strings), the average and median error rate scores 
become comparable. 

From the above set of results, we can draw the following conclusions: 

(t) The inclusion of embedded-digit training led to significant 
improvements in digit-recognition accuracy for both deliberately and 
normally spoken digit strings. 

(tz) Somewhat better recognition accuracy was obtained when the 
string length of the test sequence was known than when no knowledge 
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of string length was used. This is due to the high probability of 
inserting extraneous short digits (the embedded twos and eights) in 
deliberately, and often naturally, spoken strings. 

(tii) The accuracy with which connected-digit strings could be 
recognized can be made essentially independent of the talking rate, 
especially if one can take advantage of knowing in advance the length 
of the digit string. 

(tv) Both coarticulated and noncoarticulated embedded-digit train- 
ing patterns aid in recognizing connected strings of digits. 

(v) There are some talkers (1 of the 18 tested here) for whom all 
the training procedures failed. For this one talker, the string error rate 
exceeded 50 percent on all reference and test sets. No obvious or clear 
explanation is available for this result, except for the fact that all the 
training procedures indicated a high degree of variability in speaking 
digits for this talker; that is, it took the maximum number of iterations 
to obtian each template set. This high degree of variability basically 
implied that no single set of reference patterns could adequately match 
the digits of this talker. Hence, highly inaccurate connected-digit 
recognition resulted. 

The above results and the conclusions drawn from them indicated 
that the improved training procedure provided, in general, robust, 
reliable digit patterns that greatly aided the connected-digit recognizer 
in recognizing connected-digit strings for these talkers at essentially 
any reasonable talking rate. 


4.3 Speaker-independent recognition using only isolated-digit training 


Although the improved training procedure could be incorporated 
into a clustering analysis to provide speaker-independent, embedded- 
digit patterns, such an effort has not yet been undertaken. However, 
to provide a yardstick for comparison, a speaker-independent reference 


Table VIl—Median- and average-string error rates for speaker- 
independent recognition of connected digits 
Number of Best Candidates 


Statistic KL 1 2 3 4 5 


TS.DL 
Median 5.0 12.5 5.0 2.5 2.5 0 
18 Talkers 6.7 14.3 5.3 3.9 3.3 2.4 
17 Talkers 6.0 12.9 4.3 3.2 2.7 1.8 
16 Talkers 5.3 12.0 3.6 3.0 2.3 1.7 
TS.NR 
Median 12.5 20.0 10.0 5.0 2.5 2.5 
18 Talkers 14.2 10.3 10.3 8.1 6.3 5.7 
17 Talkers 12.7 16.8 8.7 6.6 5.0 4.4 
16 Talkers 10.9 14.8 7.5 5.3 3.9 3.4 
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set consisting of 12 isolated-digit templates for each digit was used in 
the LB-based DTW recognizer using TS.DL and TS.NR data. The results 
of these recognition tests are given in Table VII. It can be seen that 
for TS.DL, the average- and median-string error rates for the 18-talker 
population are comparable to those of the speaker-trained sets of 
Table V. (This result is expected since the variability of the 12 patterns 
of each digit is adequate for representing inherent digit variations in 
deliberately spoken strings.) However, for TS.NR, the average- and 
median-string error rates using speaker-independent, isolated-digit 
references are much larger (on the order of 2:1) than those of Table VI 
for the speaker-trained case. These results again demonstrate the 
effectiveness of the embedded-digit training for recognizing digits in 
normally spoken strings. 


V. DISCUSSION 


The results given in Section IV show that for speaker-trained, 
connected-digit recognition, high reliability and accuracy can be ob- 
tained across a fairly broad range of talking rates by combining 
embedded-digit training patterns with the standard isolated-digit 
training patterns. The question that now remains is how useful is such 
a procedure for real-world applications, and what general types of 
problems remain. The answer to the first part of the question seems 
clear. For talkers who are fairly repeatable in the way in which they 
speak digits, both in isolation and in sequences, this improved training 
procedure should make connected-digit recognition a viable procedure. 
For the remaining talkers (e.g., the talker who had substantial digit 
variability), this procedure is too simplistic and cannot adequately 
represent the variations in pronunciation of digits. An analysis of the 
errors made in trying to recognize the strings of such talkers shows no 
consistent error pattern; that is, the method basically breaks down 
because of the high variability in the spoken-digit strings. In a sense, 
the entire training procedure has broken down for such talkers; there- 
fore, the matching procedure is doomed to failure. 

The answer to the second part of the main question as to what 
general types of problems remain is not at all clear. A detailed exami- 
nation of the types of errors that occurred in all the test sets indicates 
a fairly broad mix of digit insertions, digit deletions, and digit substi- 
tutions. It would appear that for most of these errors there is no simple 
fix. In these cases, we are basically at or near the limits at which a 
matching procedure can operate. In such cases, a feature-based, con- 
nected-digit recognizer would appear to offer more promise in that it 
would be less sensitive to parametric variability so long as the detected 
features remain in the acoustic signal. Since all spoken strings were 
highly intelligible to the experimenters, it would seem that the promise 
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of perfect-digit recognition needs a less-sensitive parameterization, i.e., 
one more closely related to phonetic features, to be fulfilled. 


VI. SUMMARY 


An improved training procedure for extracting reference-word tem- 
plates for connected-word recognition systems has been described. 
The resulting reference patterns essentially model the word character- 
istics when embedded in connected-word strings. Hence, a reference 
set consisting of both isolated-word patterns and embedded-word 
patterns has the capability of providing reliable recognition of con- 
nected-word strings spoken at natural rates. In an evaluation of this 
procedure and using a digit vocabulary, it was shown that high-string 
accuracy could be obtained, if the length of the digit string was known 
a priori, for deliberately and naturally spoken digit strings. If the 
length of the digit was not known, the string error rate was somewhat 
higher due to the problem of inserting short digits into the matching 
sequence. 
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To investigate the feasibility of transmitting data bidirectionally 
between 56 and 144 kb/s, an extensive stimulation and experimental 
study was undertaken. The computational techniques for the study 
have been developed and coupled to a large data base, which contains 
the individual loop configurations of a statistically significant pop- 
ulation of the Bell System loop plant. Two modes of transmission (the 
hybrid and the time-compression multiplexing) are investigated thor- 
oughly by computational techniques, and the simulated plots are 
compared with experimentally generated oscillograms. These results 
are displayed graphically by eye diagrams for individual loops and 
by scatter plots for all the loops in the data base. Different varieties 
of scatter plots display different attributes of eye diagram. Broad 
spectrums of such results are cross-compared to choose optimal 
systems and circuit configurations for a particular mode of transmis- 
ston at a particular bit rate. Effects of varying terminating networks 
are also studied and the extent of echo cancellation necessary is 
investigated by studying the amplitude and the shape of near-end 
reflections. Various codes are investigated even though the primary 
emphasis has been on the bipolar code. Major conclusions are sum- 
marized in the concluding remarks. 


Il. INTRODUCTION 


Digital data transmission over existing loop plant facilities has been 
envisioned by numerous organizations around the world. In Japan,” 
initial designs for digital data transmission rates up to 64 kb/s are 
being investigated. In Switzerland, there has been initial interest* to 
provide an overall transmission rate of up to 160 kb/s. In Sweden, an 
80-kb/s burst system, operating at 256 kb/s in 125-ys slots has been 
investigated.* In Norway,” digital pulse-code modulation (pcm) hybrid 
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transmission rates greater than 64 kb/s (up to 80 kb/s) have been 
analyzed. In Great Britain, hybrid 64-kb/s channels have been pro- 
posed®’ by superimposing them over the baseband analog telephone 
communications. Other Western European countries®® (Germany, 
France, and Italy) have started discussion of high-speed digital trans- 
mission. 

In the Bell System, the results of preliminary investigations have 
been reported.” This paper presents the computer simulations and 
the entire cross section of results obtained from the extensive simula- 
tions that have been performed using a loop plant data base in 
conjunction with specific terminal systems. The simulations test the 
functioning of any given system with existing loops from the 1973 Loop 
Survey,” they test the effects of component design variations, and 
variations of such factors as temperature, loop configuration, etc., and, 
finally, they verify the eye opening performance at different bit rates. 

The paper also presents the results of such simulations under two 
major modes of transmission: (7) a full-duplex simultaneous bidirec- 
tional transmission of data with a hybrid and active echo canceler 
(hereafter referred to as the hybrid system), and (zi) the time-com- 
pression multiplex [“‘burst” or the “ping-pong” system (hereafter re- 
ferred to as the TcM system) ]. 

The remainder of the paper covers five major sections. Each section 
addresses a specific issue in the overall design. Section II gives the 
simulation techniques and data base organization. (The Appendixes 
contain the mathematical framework of the computation.) In Section 
III, the electrical characteristics of the loop plant, as they pertain to 
the transmission of data (2.4 to 324 kb/s), are tabulated. Some of the 
physical data are also given to indicate the wide variety of loops in the 
plant. Section IV described the time-domain simulations (with numer- 
ous codes) which yield wave shapes and eye diagrams at the receiver. 
Section V compares the performances of different systems at different 
rates between 56 to 324 kb/s. Results are summarized as scatter plots, 
each showing an attribute of the eye diagrams for all the loops in the 
data base. Section VI describes a computer-aided design technique for 
evolving an optimal equalization strategy. The effects of different 
equalizer designs on the eye openings at the receiver is also docu- 
mented. Finally, the concluding remarks give the overall summary of 
findings. 


ll. TECHNIQUES: SIMULATION, DATA BASE, AND ORGANIZATION 
STRATEGY 


2.1 Introduction 


Any repetitive periodic phenomenon may be analyzed by Fourier 
analysis. To study the transmission system response, we resort to a 
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long periodic sequence (48 pulses) of randomly distributed data. The 
flexibility of analyzing the system performance in the spectral domain 
becomes available before inverting the Fourier series to obtain the 
time-domain response. The process is carried out independently for 
the two directions of data transmission over one pair of wires. Simul- 
taneous bidirectional transmission performance evaluation in the hy- 
brid system entails calculation of reflections arising because of cable 
and termination mismatch at all the harmonics of the overall repetitive 
period. The transmission rate is the line rate. The TcM transmission, 
however, is unaffected by near-end reflections since data transmission 
is multiplexed in the time domain by increasing the transmission rate 
to 2.25 times the information rate. 

To evaluate the system performance on individual loops, it is nec- 
essary to access data bases which store statistical samples of loop 
compositions of the existing loop plant. Thus, the simulation capability 
performs four functions: computations, data base manipulations, gen- 
eration of graphical displays, and provision of a capability for evalua- 
tion and optimization of the system components. The first three of 
these functions need minimal operator intervention, and the last 
function is interactive with the system designer. 


2.2 The Basic concepts in time-domain simulations 


Transmission of digital data is a time-domain phenomenon. How- 
ever, electronic circuitry and cable characteristics are better defined in 
the spectral domain. Hence, to simulate the entire physical process of 
the data transmission, we resort to a limit-cycle’ approach. The entire 
system—the electronic circuitry and the individual loop under study 
is forced into a periodic function in which the boundary conditions at 
the beginning and the end of a predefined period are identical. The 
approach facilitates the transformation of the cyclic time-domain 
phenomenon into its harmonic spectral-domain components by Four- 
ier series. Enough flexibility is provided to alter the mode of excitation 
to correspond to any random or specified pulse pattern within a 
sufficiently long period. 


2.3 Spectral-domain considerations 


Spectral-domain responses are essential subsets that can be isolated 
from time-domain simulations. To provide systematic computations, 
the spectral-domain and time-domain simulations access identical data 
bases and subprograms. The data base generation and display pro- 
grams are separate. They have been completely automated to generate 
and graphically display predefined component and system character- 
istics. 
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2.4 Representational techniques 
2.4.1 Excitation signals 


A 48-pulse period* spans the repetitive cyclic time. Any random or 
prespecified pattern of pulses are transmitted in either direction. Delta 
function or finite-width pulses have been simulated with different 
coding (alternate bipolar, ternary, polar, etc.) options. 

Sixty-four Fourier harmonics represent the pulses. This choice re- 
flects the presence of filters with a sharp roll-off beyond the system 
cutoff frequency. The truncation at 1-1/3 bit rate (i.e., 64/48) has little 
effect on the accuracy of the simulation, but cuts the core requirement 
and execution time. 


2.4.2 Component representation 


Bidirectional data transmission systems generally have a transmit- 
ter, a loop, a filter, an equalizer, a terminating impedance, and a device 
to limit the near-end reflections.’ For the delta functions or finite 
pulse-width multilevel pulses at the transmitter, the excitation func- 
tions are defined in Appendix A. The loop (Section 2.4.3) is character- 
ized by ABCD matrices (one set for each harmonic frequency). The 
filter is represented by standard transfer functions. The equalizer is 
programmed by the distribution of poles and zeros. The matching 
impedance being passive has a simpler characterization. Finally, the 
isolating device which separates the received and transmitted signal is 
represented in the frequency domain (for frequency-domain cancella- 
tion circuits) or in the time domain (for ccD-type transversal filter 
circuits). The technique of the computation of the cancellation signal 
varies appropriately. In the TcM mode of transmission, the contami- 
nating signal is ignored because the transmitter is inactive during the 
reception of the signal and vice versa. 


2.4.3 Loop representation 


The loop constitutes the most variable component in the entire 
system. A summary of Bell System loop statistics has been reassembled 
in a dedicated data base. Composite loops (with numerous loop sections 
and bridged taps) are represented by their individual ABCD matrices 
at each harmonic frequency. The temperature of cable (aerial vs. 
buried) sections can vary drastically. Hence, temperature distributions 
are also permitted in the simulation by an identifier scheme to indicate 
particular lengths which experience the differential temperatures. 


* The 48-pulse period is an arbitrary duration. The limited core capacity of the 
dedicated computer was the primary reason for the choice. More recently, we have been 
able to extend the simulations to 480 pulses and 1200 harmonics. 


t Typical examples are echo cancelers, hybrids, frequency-domain cancellation cir- 
cuits, or time-division switches. 
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The R, L, G, and C parameters, at different temperatures for 
different lengths, are calculated by accessing a separate data base, 
storing the primary parameters” and their temperature variations. 


2.5 Computational techniques 
2.5.1 System components 


Conventional circuit analysis of components is performed by ac- 
cepting critical resistors, capacitors, and transistor data. Noncritical 
components are represented by transfer functions. When the sensitiv- 
ity analysis is performed, individual components are perturbed. When 
the sensitivity of system blocks is analyzed, the transfer functions are 
perturbed. Cascading appropriate ABCD matrices for the loop sections 
leads to composite loop ABCD parameters (see Appendix B). These 
parameters are generated from the central office to the subscriber, and 
the A and D terms are interchanged for the other direction, thus 
reducing the core requirements and execution time. The terminating 
impedance effects are computed by multiplying the ABCD matrix of 
the cable (with adequate attention to the direction of transmission) by 
the ABCD matrix of the terminating impedance. 


2.5.2 Restoration to time domain 


The excitation function has obvious poles at p = 27im/T (m = 1, 2, 
-++, M) and the computational technique for evaluating the normal 
Fourier integral becomes converted to the summation of (MM) harmon- 
ics responses over K pulse periods (see Appendix C). The two high- 
lights before the double summation in eq. (10) (see Appendix C) are 
(t) the entire transfer function of the total system including filters, 
equalizers, and any echo cancelers [i.e., ¢(p), see eq. (6), Appendix C ] 
can be obtained after finding the composite ABCD parameters for 
both directions of transmission, and (ii) the use of L’H6pital’s rule in 
evaluating the residues at the M poles [see eq. (9), Appendix C]. 


2.5.3 Simulation of echo cancellation 


During simultaneous bidirectional transmission of data, the near- 
end echo adds to the received signal within the hybrid. There are two 
ways of simulating the echo-canceler function. The first one consists 
of computing the total signal (reflections and the received pulses), and 
then subtracting the reflection out systematically. The second method 
consists of systematically dropping all, or a large fraction, of the 
frequency-domain reflected signal before summing the Fourier com- 
ponents. 


2.5.4 Echo cancellation signal from single-pulse response 


The reflections of a single-pulse excitation from the Central Office 
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(co) side and then from the subscriber (s) side are stored. There are 
240 time positions covering the 48-pulse period. These responses are 
then shifted by the same amount that any new pulse is shifted from 
the original pulse that gave the first reflection. The response of the 
pulses occupying the 48 time slots of random excitation is obtained as 
a summation of all the responses suitably displaced from the location 
of the original pulse. Since the excitation is assumed to be periodic, 
any overflow beyond the 240 time positions is reaccumulated at the 
beginning of the period. 

The subtraction of the reflected signal from the total signal by the 
echo canceler is carried out by either of two ways: (i) prespecified 
fraction (between 90 to 99 percent) of the total reflected signal is 
systematically subtracted from the total signal at the received end, 
and (ii) a randomly varying fraction averaging to a prespecified 
number (90 to 99 percent) of the total reflected signal is subtracted 
from the total received signal. The reason for the use of a second 
procedure was to reflect the uncertainties caused by the convergence 
in the tap weight setting of a real echo-canceler device. These settings 
can change slightly because of the number of iterations the device has 
experienced at any instant of time. Being an adaptive device, the 
setting varies by small increments around the average value with time, 
and the second computational procedure reflects this uncertainty. For 
the illustrations presented here, the first procedure of echo cancellation 
is used since the second one causes a small random perturbation of the 
data* obtained in the first. 


2.5.4.1 Simplified echo cancellation. The simulation of the echo can- 
celer is done by suppressing the addition of the reflected signal in the 
frequency domain before summing the Fourier components. The in- 
accuracy of this technique is because the time events are only approx- 
imated by the Fourier expansion in view of the truncation of the 
harmonics. These inaccuracies are not consistent in the representation 
of the limit cycle encompassing a finite number of pulses (48 in this 
case) with the inaccuracies in the representation of a single pulse 
whose reflection was determined in the former case. In the latter case, 
only a percentage (90 to 99 percent, depending on the quality of the 
echo canceler) of each of the harmonics of the entire 48-pulse period 
is subtracted. In the former case, the specified percentage of the total 
reflected signal in the time domain was subtracted out. However, the 
simplicity of the simulation procedure has prompted the use of the 
latter method in most cases after determining that the difference of 
the echo suppressions by the two methods does not cause more than 


* Very marginal eyes are occasionally closed, and an occasional error of the marginal 
bit occurs. 
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a 2- to 3-percent change in the eye openings in either the co side or 
the s side. 


2.6 Establishment of permanent bases 


The primary data bases consist of three permanent files: (i) the 
cable characterization, (it) the Loop Survey files holding the loop 
composition of all the 1973 survey loops up to the 18-kft loop without 
bridged taps, and (iz) the files containing the same loops including 
details of bridged-tap locations and composition. 


2.6.1 Access and reuse of selected bases 


Very long computations yield large files, and unless some systematic 
steps are taken to conserve the disc space, results obtained from 
different systems of different components cannot be easily cross-com- 
pared. Hence, certain disc files or a group of files are reused continu- 
ously. A typical group of such files holds excitation pulse patterns and 
their harmonics, filter transfer functions, equalizer transfer function 
(if it is amenable to such a representaion), single-pulse response 
function (if it is to be used in conjunction with an echo canceler), etc. 
It is also necessary for the intermediate computations to be completed 
before the reuse of the allocated disc space. A typical example of such 
a situation is after the generation of the time-domain received pulse 
data from the spectral domain. The eye statistics (eye openings, top, 
central, bottom-eye thicknesses, and the eye height) have to be com- 
puted from a separate program before the files holding the time- 
domain response can be reused. Hence, the use or reuse of intermediate 
data bases needs special consideration in the overall architecture of 
the system. 


2.7 Conclusions 


In Section I, the major highlights of the software simulation system 
are presented. Numerical algorithms, computations, and functions are 
organized independently of the major housekeeping and file manage- 
ment functions. The entire system has been tailored to work efficiently 
and smoothly on dedicated minicomputers. System capabilities range 
from simply analyzing the data for statistical characteristics from the 
data bases to the entire system simulation under data transmission 
and the analysis of failure rates, details of eye closure, and sensitivity 
analysis of components. 


li. FUNDAMENTAL LOOP PLANT STATISTICS FOR DIGITAL DATA 
TRANSMISSION 


3.1 Introduction 


Possible data transmission rates envisioned for the future range 
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from 56 to 324 kb/s. A set of seven bit rates (2.4, 4.8, 9.6, 64, 144, 216, 
and 324 kb/s) is chosen to span the range of interest. The 1973 Loop 
Survey presents a cross section of loops that may be expected to carry 
the data traffic. However, they may be generally limited to a loop 
length of 18 kft because of the presence of loading coils beyond this 
distance. This section presents characterization of the loops in two 
loop configurations: (z) with all bridged taps stripped, and (ii) with all 
bridged taps intact. Image impedances (at the half-bit-rate frequencies 
1.2, 2.4, 4.8, 32, 72, 108, and 162 kHz) are also presented. The physical 
statistical data are presented in condensed form and selected electrical 
parameters are also tabulated. 


3.2 Organization of results (loops with bridged taps stripped) 
3.2.1 Physical characteristics 


Eliminating loops longer than 18 kft, those with loading coils, and 
nonstandard cable sections, about 76 percent (831* loops) of the 1098*- 
loop population surveyed” constitutes the truncated data base over 
which the digital data transmission studies are based. A histogram 
relating the population to distribution of the loops by their length and 
the number of cable section(s) is shown in Fig. la. The typical overall 
composition of an average loop (in the truncated population), which is 
7748.63 ft long, has 4500.59 ft of No. 26 awc cable with 2408.47 ft of 
No. 24 AWG cable, 797.52 ft of No. 22 Awe cable and, finally, with 42.03 
ft of No. 19 awa cable. Figure 1b shows the average composition of 
the loops in each kft band between 0 to 18 kft. 


3.3 Typical results generated at 32 kHz for 64-kb/s rate (loops with 
bridged taps stripped) 

There are three sets of results for the three temperatures (140°F, 
60°F, and 0°F) at which the simulation is done over all the loops in 
the data base. This section gives the typical results at 60°F. In Figs. 2 
and 3, the loop attenuations are shown as scatter plots of the individual 
attenuation of each loop. Figure 3 gives the data when the attenuation 
is plotted against the converted loop length. The conversion of the 
actual loop length to its equivalent No. 22 Awe cable length is done by 
first determining the equivalency numbers between cable length for a 
given attenuation and then by cumulating the length of each loop 
section in terms of its equivalent length. 

In Figs. 4 and 5 the resistive component of the impedance is plotted 
against the reactive component for each of the loops at 32 kHz and 
60°F. Denser clusters of points are detected around the impedance 


* For clarification, “loop survey” refers to the 831-loop data base, and “Loop Survey” 
refers to the original 1098-loop data base. 
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(147-794) ohms of No. 26 awe cable. This result reinforces the domi- 
nance of No. 26 Awe cable also represented in Fig. la. 


3.4 Summary of results (loops with bridged taps stripped) 


At each frequency and at each temperature, the high, low, and 
average values of the most important parameters are tabulated (see 
Table I). In addition, two specific parameters evolve from this com- 
putation: (i) the gauge conversion factor for equalization, and (iz) the 
temperature length equivalency number. The first parameter becomes 
critically important while determining the equalization required to 
compensate for mixed gauge loops at any particular frequency. For the 
digital transmission, we have found that if the conversion numbers 
estimated at half-baud-rate frequency are used consistently at all 
harmonic frequencies of the pulse pattern, the eye diagrams predicted 
after the analysis very closely match those obtained by an experimental 
procedure. Hence, for data transmission at 216 kb/s, 1 kft of Nos. 19, 
24, and 26 Awe cables yield the same loss as 0.648, 1.408, and 1.923 kft 
of the No. 22 awe cable. These results are obtained by plotting the 
loss of the various gauges at 108 kHz, and then proportioning the 
lengths. 

The second conversion number is also obtained by a similar proce- 
dure. The equivalent change in the length, caused by every degree 
change in temperature, is evaluated by plotting the losses at numerous 
temperatures, and then a coefficient of temperature sensitivity is 
obtained by proportioning the length accordingly. 

This coefficient a, which relates the length (2 in feet) at temperature 
t2°F and the length (J; in feet) at temperature ¢,°F, is defined as 


L=h[1+ a(t — th)) ft. 


Coefficient a has been evaluated as 2.185 x 107° per degree F, at 108 
kHz. This number becomes important in determining the equivalent 
length of loops in terms of No. 22 Awe cable at 55°F. 


3.5 Organization of results (loops with bridged taps intact) 
3.5.1 Physical characteristics 


The truncated data base consisting of 831 loops has a total of 1865 
bridged taps. The average loop is 7748.63 ft long, has approximately 
four sections, and has 1.64 bridged taps on it. The length of an average 
tap is 922.42 ft, consisting of about 54 percent of No. 26 awe, 33 
percent of No. 24 Awe, 12 percent of No. 22 Awe, and less than 1 
percent of No. 19 AawcG cable composition. The median distance be- 
tween the co and the first tap is 6523 ft, and the median distance 
between the first and second taps is 1071 ft. Loops between 0 to 1 kft 
have the longest average length of bridged tap—1333.3 ft, with 1.15 
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Table I—Maximum, minimum, and average values of loop impedances at different frequencies (no bridged taps) 


Central Office Side of Loops (Zin) Subscriber Side of Loops (Zin) 
Degrees Maximum Minimum* Average Maximum Minimum* Average 

kb/kHz F Re Im Re Im Re In Re Im Re Im Re Im 
2.4/1.2 140 744 —696 400 —400 649.3 —628.8 720 —544 416 —380 632.0 —634.1 
60 680 —644 376 —376 607.6 —586.9 664 —660 388 —352 593.9 —592.3 
0 632 —616 368 —332 572.7 —552.0 640 —616 352 —344 561.7 —557.0 
4.8/2.4 140 544 —500 272 —270 466.4 —443.1 512 —500 264 —265 446.6 —443.8 
60 504 ~456 256 —256 437.1 —412.5 480 ~—456 264 —240 420.4 —414,4 
0 480 —432 240 —240 412.5 —387.0 456 —344 256 —228 398.2 —389.6 
9.6/4.8 140 400 —365 190 —180 336.6 —311.2 380 —350 180 —190 318.5 —307.1 
60 370 —325 190 —160 316.7 —288.7 360 —325 165 —160 300.4 —286.1 
0 360 —300 180 —155 299.9 —269.8 340 —290 162 —155 285.2 —268.4 
64/32 140 170 —110 97 —22 144.6 —93.1 170 —110 85 —16 138.3 —86.8 
60 170 —101 92 —16 140.4 —83.9 165 —100 85 —-14 134.4 —78.2 
0 168 —94 92 —14 136.9 —76.3 160 —96 85 —12 131.3 —70.9 
144/72 140 145 —67 90 -17 120.4 -51.3 150 —70 90 —8 117.4 —47,.2 
60 145 —60 90 —-15 119.4 —45.3 147 —61 86 —7 116.6 —41.7 
0 145 —55 90 -11 118.7 —40.5 145 —37 83 —6 116.1 —37.1 
216/108 140 134 —54 85 —6 114.1 —37.4 140 —50 89 —6 112.0 —34.7 
60 134 —50 86 —5 114.0 —32.9 140 —46 90 —4 112.1 —30.5 
0 134 —45 86 —4 114.1 —29.3 136 —43 94 -1 112.2 —27.1 
324/162 140 130 —40 86 —6 110.1 —27.4 134 —42 87 —5 108.1 —25.7 
60 132 —34 88 -3 110.6 —24.1 132 —40 86 —3 108.8 —22.7 
0 134 —36 90 —1 111.1 21.5 134 —39 88 —l 109.3 —20.3 


* The low numbers in these columns are not very accurate, since they are not program generated; instead, they are all scaled from the scatter 
plots. 


bridged taps for each loop. The average composition of each bridged 
tap is 59.3, 37.0, 3.7, and 0.0 percent of Nos. 26, 24, 22, and 19 Awe 
cables. The total number of loops in this kft band is 20, and 17 of these 
do have bridged taps. The average loop length is 776 ft. 

Loops between 17 to 18 kft have the highest average number (2.87) 
of bridged taps per loop. The average loop length is 17,331.19 ft, and 
the average bridged tap is only 626.9 ft. The average composition of 
the bridged tap is 10, 74.0, 16.0, and 0.0 percent of Nos. 26, 24, 22, and 


Table Il—Bridged-tap distribution by loop lengths (0 to 18 kft) 


No. of Loops, 
No. of Bridged No Bridged 
Loop Length in Feet Taps Taps No. of Loops 

0-1000 23 3 20 
1000-2000 48 6 38 
2000-3000 82 4 56 
3000-4000 104 13 69 
4000-5000 104 12 64 
5000-6000 115 16 77 
6000-7000 114 5 64 
7000-8000 115 7 72 
8000-9000 121 5 67 
9000-10,000 97 3 53 
10,000-11,000 96 8 55 
11,000-12,000 87 8 52 
12,000~13,000 59 3 38 
13,000-14,000 50 5 31 
14,000-15,000 45 1 26 
15,000-16,000 32 3 21 
16,000-17,000 27 0 12 
17,000-18,000 46 2 16 


Total no. of loops = 831 Loops: No bridged taps = 104 Loops with bridged taps = 727 


Table IIl—Average loop length and average bridged-tap length 


distribution 
Average Loop Average No. of | Average Bridged- 
Loop Length in Feet Length Bridged Taps Tap Length 

0-1000 776.00 1.15 1333.35 
1000-2000 1559.74 1.26 995.58 
2000-3000 2548.87 1.46 782.12 
3000-4000 3544.38 1.51 1121.13 
4000-5000 4539.55 1.62 939.42 
5000-6000 5445.13 1.49 962.14 
6000-7000 6481.23 1.78 740.68 
7000-8000 7483.33 1.60 957.54 
8000-9000 8541.78 1.81 904.93 
9000-10,000 9446.68 1.83 848.39 
10,000-11,000 10572.91 1.75 843.49 
11,000-12,000 11466.83 1.67 1092.56 
12,000-13,000 12435.87 1.55 938.66 
13,000-14,000 13383.77 1.61 932.66 
14,000-15,000 14412.73 1.73 917.27 
15,000-16,000 15447.76 1.52 926.06 
16,000-17,000 16490.17 -f 25 1016.19 
17,000-18,000 17331.19 87 626.91 


Total no. of bridged taps = 1365 Average length of bridged taps = 922.42 
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Table IV—Maximum, minimum, and average values of loop impedances at different frequencies (with bridged taps) 


Central Office Side of Loops (Zin) Subscriber Side of Loops (Zin) 
Temper- 
ature Maximum Minimum Average Maximum Minimum Average 
kb/kHz °F Re Im Re Im Re Im Re Im Re Im Re Im 
2.4/1.2 140 704.8  -—639.7 176.8 —513.2 ' §60.0  —499.2 649.9 —640. 178.3 120.6 497.47 533.24 
60 652.3 —597.5 160.4 -478.3 520.8 —467.7 609.6 —597.5 161.5 -—114.7 A71.19  —497.8 
0 606.8 —562.1 147.8 449.7 488.8 —440.9 575.8 —562.1 148.6  -—109.5 448.5 —468.0 
4.8/2.4 140 520.4 —453.0 123.4 364.3 411.8  —343.1 476.3 —449.9 131.8 —78.8 340.5 —340.47 
60 486.0 —418.9 120.3 337.3 383.6 —321.7 440.7 419.5 123.9 —73.5 323.9 —346.9 
0 456.8 —393.5 117.3 315.3 360.2 —303.4 412.1 -394.4 117.3 —69.5 309.6 —326.6 
9.6/4.8 140 393.1 —324.8 91.5  —259.8 303.3 —230.7 351.6 —314.1 80.4 —57.2 234.5 —252.5 
60 371.5 302.1 83.3 —239.0 284.2 —215.8 331.0  —292.6 78.6 —49,2 223.9 —236.5 
0 349.4 —281.4 77.6  —221.83 268.4 —203.2 313.2 —274.3 78.3 —44.5 214.9 —222.8 
28/56 140 204.4 —129.7 48.8 —97.2 150.3. —100.7 198.8  —128.9 31.2 —9.5 106.6 —82.4 
60 198.7  -—119.5 44.7 —87.3 145.2 —82.4 188.9 —-—117.6 30.5 —7.0 103.8 —76.0 
0 193.9  -—112.4 41.8 -79.1 141.1 —67.1 178.0 —109.2 30.2 —5.4 101.6 —70.6 
32/64 140 196.0  -120.1 47.1 —88.9 144.3 —99.1 193.2 —121.0 28.9 —4.3 102.3 —74.7 
60 194.7  -—112.2 43.8 —79.6 139.9 —82.0 188.3 —112.2 29.1 —0.9 99.8 —68.7 
0 192.9  -105.5 40.3 —71.9 136.4 —67.0 181.3 —103.2 28.6 +2.6 98.0 —63.7 
40/80 140 187.8 —108.9 44,7 -—76.1 135.7 —95.2 177.9  —105.0 25.5 +3.1 96.1 —63.2 
60 189.8 —99.4 43.9 —67.7 132.3 —82.2 181.3  —100.5 24.9 +8.9 94.2 —57.8 
0 189.6 —94.8 41.8 —60.8 129.7 —70.2 183.1 —96.6 24.8 +15.3 92.9 —53.4 
72/144 140 187.1 ~-101.7 40.3 —49.3 120.5 —67.5 157.6 —84.3 21.5 +27.8 85.2 —39.7 
60 190.5 —98.4 40.1 —43.2 119.4 —62.4 164.1 —84.5 21.3 +43.7 84.6 —35.9 
0 192.0 —93.0 40.1 —38.3 118.7 —57.6 175.3 —85.7 21.2 +54.6 84.2 —32.8 
96/192 140 206.7 —79.7 38.6 —39.5 115.8 —49.6 180.3 —84.1 19.4 +49.7 82.6 —31.5 
60 233 —80.2 38.6 ~34.5 115.4 —49.6 186.8 —83.5 20.3 +54.4 82.5 —28.1 
0 253.3 —81.1 38.7 —30.5 115.2 —50.6 200.7 —85.1 18.9 +56.8 82.4 —25.4 
108/216 140 212.2 —-105.5 37.9 —36.1 114.4 —38.0 180.1 —82.7 16.3 +59.4 81.8 —29.0 
60 198.8  —111.5 38.0 —31.5 114.2 —36.8 194.0 —81.4 16.5 +68.5 82.0 —25.9 
0 201.8 —114.2 38.1 —27.8 114.2 —37.5 203.1 —84.4 16.8 +74.9 82.1 —23.3 
162/324 140 216 —119.7 35.1 —26.7 110.2 —34.6 184.8  —100.6 15.2 +745 78.5 —22.4 
60 233.4 —133.4 35.4 23.4 110.7 —27.9 213.6 —105.3 14.0 +81.4 79.2 —20.3 


0 261.5 —145.5 35.6 —20.8 111.2 —19.8 242.0 -108.1 13.3 +88.0 79.8 —18.6 


Table V—Maximum and average attenuation of 


loops 
Attenuation in dB 
Temperature 
kb/kHz (°F) Maximum Average 
2.4/1.2 140 11.7 4.93 
60 10.7 4.49 
0 9.9 4.13 
4.8/2.4 140 15.6 6.92 
60 14.1 6.29 
0 13.3 5.78 
9.6/4.8 140 21.5 9.63 
60 19.5 8.73 
0 18.0 8.01 
28/56 140 45.5 19.93 
60 41.0 17.75 
0 37.4 16.02 
32/64 140 47.6 20.82 
60 42.9 18.50 
0 39.1 16.66 
40/80 140 50.6 22.27 
60 45.6 19.72 
0 41.7 17.72 
72/144 140 60.0 26.30 
60 53.3 23.22 
0 47.9 20.84 
96/192 140 66.2 28.46 
60 60.5 25.21 
0 56.4 22.73 
108/216 140 69.1 29.44 
60 61.9 26.13 
0 56.4 23.62 
162/324 140 76.0 33.15 
60 69.0 29.68 
0 63.5 27.08 


19 AwG cables, respectively. Between these extremes (shortest loops 
with longest bridged taps and longest loops with shortest bridged taps) 
lies a population whose average bridged tap length is fairly consistent. 
Even though the population density is sparse, the in-extreme bands of 
length (0 to 1 kft and 17 to 18 kft), the population density in the 
intermediate bands (Table II) is relatively high and the statistical 
information derived is dependable. These statistics are tabulated in 
Tables II and III. The percentage of loops having bridged taps (20, 
>1, =2, etc.) is shown in Fig. 6. Figure 7 (compare against Fig. 1a) 
displays the gauge distribution in the bridged tap. These distributions 
indicate similar patterns with a dominance of No. 26 awe cable, which 
is gradually replaced by No. 24 awa cable as the length of the loop 
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LOOP LENGTH IN KILOFEET 
Fig. 1b—Distribution of gauge numbers with loop length. 
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ATTENUATION IN DECIBELS 





LOOP LENGTH IN KILOFEET 


Fig. 2—Scatter plot of loop attenuations plotted against the loop length at 32 kHz 
and 60°F. The X-scale is 1 kft per division and Y-scale is 5.0 dB/division. 


increases. Numbers 22 and 19 AwG cables do not dominate either the 
bridge tap or the loop lengths. 


3.6 Summary of results (loops with bridged taps intact) 


The loops in the survey have been simulated at three temperatures 
(140°F, 60°F, and 0°F). However, the discrete frequencies have been 
expanded to include (28, 40, and 96 kHz for 56, 80, and 192 kb/s rates). 
A summary of the image impedance at the ten discrete frequencies 
(1.2, 2.4, 4.8, 28, 32, 40, 72, 96, 108, and 162 kHz) is tabulated in Table 
IV. Table V gives a summary of loop attenuations. 


3.7 Conclusions 


The loop plant environment is far from the ideal situation where 
uniform gauge wires run from the co to the s. Any design of the 
bidirectional data transmission facility has to accommodate the wide 
disparity of cable compositions, bridged-tap configurations, and highly 
variable impedances looking from either direction of the loop. The 
range and statistics of these variations have been presented in this 
section as tables and scatter plots for use in the design of terminal 
equipment of digital subscriber lines. 
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Fig. 3—Scatter plot of loop attenuations plotted against the equivalent loop length of 
No. 22 awe cable. The frequency is 32 kHz and the temperature is 60°F. The X-scale is 
2 kft/division and Y-scale is 5.0 dB/division. 
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Fig. 4—Scatter plot of loop resistance plotted against its impedance (Central Office 
side). Temperature = 60°F, Frequency = 32 kHz, X- and Y-scales = 20 ohms/division. 
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Fig. 5—Scatter plot of loop resistance plotted against its impedance (subscriber side). 
Temperature = 60°F, Frequency = 32 kHz, X- and Y-scale = 20 ohms/division. 
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Fig. 6—Percentage of loops ( Y-axis) having bridged taps equal to or greater than the 
bridged-tap number (X-axis) plotted. 
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Fig. 7—Distribution of percentages of gauge sizes in the bridged taps. 


IV. TIME-DOMAIN SIMULATIONS 
4.1 Introduction 


In this section, we present the time-domain results of typical hybrid 
data transmission systems, typical time-compression multiplexing sys- 
tems with bipolar, ternary, partial response, and WAL-2 modes of 
coding. Single-pulse responses, pulse shapes, and eye diagrams are 
presented over a wide range of loop and terminating configurations. 

The time-domain response of a transmission system is critically 
important in evaluating its capabilities to carry digital data. Such a 
response can be obtained from the spectral-domain response of the 
system by using conventional or fast Fourier transforms. However, 
FFTs were not used due to limited core capacity of the dedicated 
minicomputer on which the simulations were performed. The basis for 
obtaining the time-domain response of the entire system has been 
summarized in Section II. 


4.2 Performance of the computational model 


The flexibility of the software system to permit the simulation of 
any linear system configuration has been indicated (see Section II). 
Here, a hybrid duplex system with two electronic terminations having 
two four-section, sixth-order loss equalizers, (with equalization tailored 
for each loop) has been simulated. 

When the cable length is made vanishingly small, typical component 
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(especially filter) performances and the validity of the model may be 
verified. Further, the delay through the filters may also be accurately 
determined. To establish these facts, two results are presented. 

In Fig. 8a, a sequence of random pulses is used to excite the system 
from the co to the s side. Forty-eight pulse positions are plotted here. 
The first 16-pulse positions are shown in the top one-third of the figure 
(see the second horizontal line from the top). The second and third 16- 
pulse positions are plotted on the fourth and sixth lines of the figure. 
Also, the second (+1)-pulse position (20.83 ys)* after the start of the 
48-pulse duration is the response of the first (+1) pulse at the trans- 
mitter end. 

In Fig. 8b, the eye diagram corresponding to the pulse train in Fig. 
8a is shown. There are seven lines constituting the eye diagram. The 
two outermost lines are formed by 0, 1, 0 and 0, —1, 0 transitions. The 
two intermediate inner lines are formed by —1, 1, —1 and 1, —1, 1 
transitions, and finally the three innermost lines are formed by 1, 0, 
—1; —1, 0, 1; and 0, 0, 0 transitions of the pulses. A pulse diagram 
similar to that shown in Fig. 8b for the S-to-co direction of transmission 
is also obtained. 


4.3 Single-pulse response over loops 


Single-pulse response is essential to judge the capabilities of the 
overall system. Three crucial results emerge from the single-pulse 
response study: the deterioration of the received pulse, the extent of 
the long decaying tails left behind by the pulse, and the magnitude 
and the shape of reflections of the sent pulse at the sending end. 
Component values affecting these responses can then be suitably 
adjusted if the results prove unsatisfactory. 

Single-pulse responses from single and multiple sections have been 
routinely computed, examined, and verified. Typical results from a 
three-section composite cable loop are presented in Figs. 9a through 
9d. The loop consists of 3133 ft of No. 26 awa cable, together with 
8367 ft of No. 22 AwG cable, in series with 1365 ft of No. 19 awe cable. 
The long tail of the pulse (Fig. 9a) received at the s side indicates that 
the following pulses would be offset from their zero position because 
of the influence of prior pulses. Next, consider the shape of the reflected 
pulse (Fig. 9b) on the s side. There are three peaks embedded in this 
pulse shape, and the central peak being almost as high as the received 
pulse would be a source of severe interference. Unless the reflections 
are substantially cancelled out by an echo canceler (see Section 4.4), 
the eye diagram would be considerably distorted. 


*The bit rate for this particular simulation is 144 kb/s, and the delay can be 
computed by measuring the distance of the peak of this pulse (+1) from the instant ¢ 
= 0 at the left-hand corner of the figure for the first 16 pulses. 
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Next, examine Figs. 9c and 9d. Here, the problem is exaggerated 
because of the relative magnitude of the received and reflected pulses, 
the ratio of these being 1:2.84. If it is desired to keep the pulse 
distortion down to 5 percent of the received-pulse height, the minimum 
amount of cancellation for the reflected signal should be 35 dB. 
Advancing one stage further, the single-pulse response can also predict 
that the nature of interference from a bipolar-pulse sequence 1, —1, 1, 
—1 is going to be different from the nature of interference caused by a 
pulse sequence 1, 0, 0, —1. In the former case, the positive half of the 
reflected pulse tends to become additive to the leading negative 
reflection of the next pulse, thus increasing the demands on the 
cancellation to about 41 dB. In the latter case, the reflections do not 
become additive because of the intermediate zeros in the pulse se- 
quence. Similar considerations exist for the many different coding 
algorithms that can be used for transmission. 


4.4 Random-data response 


Random-data responses have been routinely generated to test the 
capability of the system over numerous loops. For the sake of consist- 
ency, results from the simulation of the loop in Section 4.3 are pre- 
sented under this type of excitation. A random-pulse sequence of 48 
bits is generated by selecting random bits of random numbers gener- 
ated by the computer. ; 

Further, we have been able to document excellent correspondence 
between computed eye diagrams and those from experimental results 
over a large number of loop configurations and bit rates. An example 
of this correspondence is evident between Figs. 10a and 10b. The 
experimentally generated eye (Fig. 10a) is obtained by transmitting 
144-kb/s data over 18 kft of No. 22 awe cable with 35 dB of echo 
cancellation (see Section 2.5). The eye opening is 74.7 percent. The 
computationally generated eye (Fig. 10b) is also obtained for the same 
conditions, and the eye opening is 74.8 percent. The eye opening is 
defined as the average of the top and bottom eye openings as a 
percentage to the average of the top and bottom eye heights. 


4.4.1 Bipolar coding (hybrid transmission) 


In the hybrid mode of transmission, the quality of the eye diagram 
depends on the extent of echo cancellation. To illustrate this point, 
Fig. 1la shows the eye diagram at the co side of the loop, with no echo 
cancellation at 144-kb/s on a typical loop (3133 ft, No. 26 awe; 8367 ft, 
No. 22 awa; and 1365 ft, No. 19 Awe cables). Figure 11b shows the eye 
with perfect echo cancellation. Finally, Fig. 11c, shows an eye diagram 
with 30-dB echo cancellation. 
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4.4.2 Bipolar coding (time-compression multiplexing) 


The transmission rate is increased to 324-kb/s for an effective 
bidirectional rate of 144 kb/s. The eye diagrams in this mode of 
operation are obtained by suppressing all reflections in the frequency 
domain, since only a unidirectional flow of data is present at any one 
instant. Figure 12 shows the eye diagram at the co side of the loop. 


4.4.3 Ternary coding 


Simulations of ternary coding were limited because of the increasing 
degradation of the eye diagram. Most of the eyes were closed with 
composite cable loops and, hence, the results from a loop with 18 kft 
of No. 22 awa cable, with 100 percent echo suppressions are given in 
Figs. 18a and 13b. Because of the symmetry of the loop, the correspond- 
ing figures for the other side of transmission are substantially the same 
as those in Figs. 18a and 13b; however, minor differences because of 
changes in clusters of bit patterns in the two 48-bit sequences (one 
from the co side to S side and the other from the S side to Co side) are 
present. The ripple between bits —1, —1, —1, --- and 1, 1, 1, --- was 
primarily responsible for the thickening of the eye in Fig. 13b. Further, 
long tails are also observed at the end of a sequence of data. 


4.4.4 Class four partial response coding 


Class four partial response coding’® yields wave shape and eye 
diagram (Figs. 14a and 14b) considerably different from those with 
bipolar coding. Two distinct degradations occur. In the Y direction, 
the eye opening relative to the peak tends to suffer more because of 
the relative displacement of the peak of the eye relative to the peak 
eye opening. In the X direction, the horizontal eye opening tends to 
become limited because of the limited response of the system—the 
major advantage being that the system bandwidth is about one-half 
the bandwidth for bipolar coding. The particular loop over which the 
data transmission was simulated does not have any bridge taps but is 
composed of three cable sections as discussed in Section 4.3. 


4.4.5 The WAL-2 codes 


The WAL-2 codes also limit the bandwidth necessary for transmis- 
sion.’’ The received data wave shape and eye diagram (Figs. 15a and 
15b) also differ from those with the earlier types of codes. Here, the 
limited system response generates slowly varying pulse shapes (Fig. 
15a). Generally, these codes are robust-to-line discontinuities but suffer 
in the presence of bridged taps. For the simulations given here, the 
loop was approximately 11 kft long, with nine cable sections and three 
bridged taps. 
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4.5 Time-compression multiplexing results with modified terminal 
configurations 


The equalizer used for simulations in Section 4.4 has the capability 
to be tailored to the line loss by cascading up to three sections of sixth- 
order equalizers. Each of these sections provides for the loss of about 
6000 ft of No. 22 awe cable at 144 kHz. However, this particular 
approach has been abandoned in favor of an extended range equalizer 
(see Section 6.5). The circuitry for this equalizer has been adjusted 
such that there is a clear match between the attenuation of cables at 
different frequencies and the corresponding equalizer gain. Next, to 
simplify the termination impedance, the matching circuitry is substi- 
tuted by a single resistance, which approximates the statistical average 
of the loop impedances at half the baud rate (see Table I, Section III). 
Further, it has been computationally determined (and experimentally 
verified) that the harmonics of the transmitted signal can be ade- 
quately controlled by adjusting the pulse width and, thus, saving the 
cost of an additional filter. Hence, the time-compression multiplexing 
simulations differ from the hybrid system to the extent that 

(t) the bit rate is 2.25 times the bidirectional hybrid transmission 
bit rate, 
(ii) the echo cancellation function is absent in the TCM mode, 
(iii) the initial filtering of transmitted pulse is eliminated, 
(iv) the pulse width has been computationally optimized at about 
40 percent of the pulse period, and 
(v) the matching circuit for all the cables has been approximated 
as the statistical average of all the resistive components of the 
image impedance of the cables at a frequency equal to half the 
baud rate. 

The advantages of the hybrid system inherent in maximizing the 
bandwidth utilization and the impulse noise immunity have to be 
weighed against the complexity of the echo canceler necessary to 
eliminate the reflections and also against the enhanced NEXT inter- 
ference (over that in the TcM system) into other cable pairs. Con- 
versely, the simplicity of the design and construction of the time- 
compression multiplex system has to be weighed against the higher 
loop losses and increased impulse noise susceptibility caused by in- 
creased bandwidth. 

In Figs. 16a and 16b, the experimentally and computationally gen- 
erated eye diagrams are compared for 14 kft of No. 24 awG loop at 324 
kb/s. The eye opening predicted by the simulation is 85 percent, and 
the experimentally determined eye opening is 83 percent at an instant 
ti, and is 76 and 75 percent, respectively at instant ¢2. Similar results 
are obtained for typically mixed cable composition in Figs. 17a and 
17b for a loop consisting of four sections. The No. 19 Awe cables at the 
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end of the loop make the configuration close to the worst type of cable 
composition. The experimentally determined eye opening is approxi- 
mately 54 percent and the computationally predicted eye opening is 
57 percent. 


4.6 Effects of bridged taps 


Bridged taps play a critical role in judging the capability of existing 
loops to carry digital data. At a particular bit rate, the number, lengths, 
and location of taps influence the pulse shapes and, consequently, the 
eye diagrams. For the simulation presented in this section, a No. 22 
AWG, Single-cable 10.5-kft loop with perfect echo cancellation is used. 
The results of the simulation are presented in Figs. 18a through 18d 
for zero-, one-, two-, and three-bridged taps, respectively. In Fig. 18a, 
there are no bridged taps on a 10.5-kft, No. 22 AwG loop. The bit rate 
is 144 kb/s, with perfect echo cancellation for all the simulations. In 
Fig. 18b, one bridged tap (No. 22 awa, 1.5 kft long) is added 2 kft away 
from the co end of the loop. In Fig. 18c, a second bridged tap (No. 22 
AWG, 1.5 kft long) is added 1.5 kft away from the first bridged tap. In 
Fig. 18d, a third bridged tap (No. 22 awe, 1.5 kft long) is added 1.5 kft 
away from the second bridged tap. The co-to-s side eye diagrams for 
_ the four cases* are shown. 

The results for other gauge cables are similar, but the results for 
composite cables with bridged taps are dramatically inferior to those 
presented here because of mismatch (and hence, additional eye clo- 
sure) between the loops and terminating impedances. 


4.7 Effects of pulse widths 


Harmonic content of the transmitted signal may be controlled by 
filtering delta function excitation pulses or by altering the pulse width 
of the transmitted signals. The Tcm system is a special beneficiary of 
the latter method (thus reducing the cost of an additional filter) 
because of the absence of received data during transmission. This 
makes the system more robust against NEXT interference (if the 
transmission and reception of data is synchronized). 

The effect of changing the pulse width was studied with a particular 
TCM system. The simulation was carried out on a 17,314-ft eight-section 
loop from the 1973 loop survey data base. The composition of the loop 
is 7789 ft of No. 26 awe; 2998 ft of No. 24 awa; 2564 ft of No. 26 AWG; 
25 ft of No. 24 awe; 957 ft of No. 26 awe; 2120 ft of No. 24 awe; 141 
ft of No. 26 awa; and 720 ft of No. 24 awe cables. From our experience, 
this particular loop had the worst eye opening carrying 


* Overall optimization in the system design yields significantly better eye openings 
(see Sections VI and VII) in spite of bridged taps. In this Section, eye degradations 
caused by bridged taps in one of the early systems is shown. 
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324-kb/s data at 60°F in the TcmM system. The results of changing the 
pulse width are shown in Fig. 19, showing a broad insensitivity to pulse 
width in the 20- to 40-percent duty cycle range for the particular 
equalizer simulated. This test has prompted the use of 40-percent duty 
cycle in the subsequent TcM simulations. 


4.8 Conclusion 


Simulations of the two general configurations for the bidirectional 
digital line have been accomplished. The system of time-domain 
programs is capable of simulating most of the physical conditions that 
exist on loops. Effects of temperature, code variations, component 
tolerances, and loop configurations on the eye diagrams have been 
routinely analyzed. Proposed systems are entirely simulated, and the 
effectiveness of each component on the eye opening is individually or 
combinatorially determined. The presence of bridged taps and their 
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Fig. 8—System characteristics with zero loop length. (a) Pulse pattern and shape at 
the receiver. (b) Eye diagram at the receiver. 
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critical location in the loops is analyzed by a series of simulations. 
Crosstalk and impulse noise effects are not included in the simulation 
reported here. 

The execution time of the programs has been about 20 minutes on 
a Data General Nova 2 minicomputer and about 50 seconds on a Data 


General Eclipse S-250 computer. It also translates to about 10 seconds 
on the IBM 370. 





Fig. 9—Transmission and reflection of a single pulse on a typical loop consisting of 
three cable sections. Pulse rate is 144 kb/s, and each box width corresponds to 1-pulse 
duration. (a) Received pulse at s end of loop. (b) Reflected pulse at s end of loop. 
(c) Received pulse at co end of loop. (d) Reflected pulse at co end of loop. 
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Fig. 10—Experimentally (a) and computationally (b) generated eye diagrams by 
transmitting bidirectional data over 18 kft of No. 22 Awe cable. The echo cancellation 
is 35 dB. 
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Fig. 11—Eye diagrams obtained a transmission of 144 kb/s bidirectional data over 
a typical loop. In (a) there is no echo cancellation, in (b) there is perfect echo- 
cancellation, and in (c) there is 30-dB echo cancellation. 
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Fig. 12—Eye diagram obtained by transmission of data at 324 kb/s (for an effective 
bidirectional rate of 144 kb/s) in the TcM mode. The loop configuration is the same as 
that for the simulations in Fig. 11. 
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Fig. 13—Pulse pattern and eye diagram obtained by ternary code at 144 kb/s. The 
loop is 18 kft of No. 22 awG cable. (a) Subscriber-side pulse pattern. (b) Subscriber-side 
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Fig. 14—Pulse pattern and eye diagram obtained by class four partial-response coding. 
The toon is a typical three-section loop, and the bit rate is 144 kb/s. (a) Subscriber-side 
received data. (b) Subscriber-side eye diagram. 
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Fig. 16—Experimentally (a) and computationally (b) generated eye diagrams over 14 
kft of No. 24 awe cable. The bit rate is 324 kb/s, and the half-bit rate loop attenuation 


is approximately 39 dB. 
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Fig. 17—Experimentally (a) and computationally (b) generated eye diagrams over a 


four-section (3.46 kft, 19 AwG; 8.7 kft, 24 awe; 9.8 kft, 22 Awa; 3.46 
324 kb/s in the Tcm mode. The loop attenuation at half-bit rate is approximately 48 dB. 
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Fig. 18—Degradation of the eye diagram with an increasing number of bridged taps. 
(a) Subscriber-side eye diagram, no bridged taps. (b) Subscriber-side eye diagram, two 
Fig. 19—Effect of varying the pulse width on the eye opening of 17,314-ft loop 


bridged taps. (c) Subscriber-side eye diagram, two bridged taps. (d) Subscriber-side eye 
composed of eight discrete cable sections with a typical equalizer (see Section 5.5). 


diagram, three bridged taps. 
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V. SUMMARY OF 1973 LOOP SURVEY SIMULATIONS UNDER VARIOUS 
TERMINAL CONFIGURATIONS 


5.1 Introduction 


The time-domain simulations yield pulse shapes and eye diagrams. 
The loop plant, however, consists of a widely diverse collection of 
loops (see Section III). Thus, we have been able to establish a com- 
putational link between the simulation programs and the loop survey 
data base such that the program systematically simulates the data 
transmission with any prescribed terminal configuration loop by loop 
on a statistically significant population of the Bell System loops. 


5.2 From eye diagrams to eye statistics 


Two hundred and forty equally spaced instants of time span the 48- 
pulse period over which the 48 random binary pulse data are trans- 
mitted. Hence, the pulses are approximated as a sequence of five 
straight lines (in the time domain). Out of the 240 instants of time for 
scanning 48 ones, zeros, or minus ones, 144 instants are selected for 
the scanning of the 48 data bits. The first set of 48 instants contains 
the instant at which the absolute maximum value of the incoming 
magnitude is located. The second set of 48 instants precedes the first 
48 instants by one-fifth the pulse duration, and the third set follows 
the first 48 instants by one-fifth the pulse duration. 

Next, for each of the three sets of 48 instants at which the scanning 
can be done, the data are arranged in diminishing order of magnitude. 
Three clusters of data are generated: (i) at the upper level, correspond- 
ing to the plus ones received in the data; (iz) at the intermediate level, 
corresponding to the zeros received; and (iii) at the lower (minus) 
level, corresponding to the minus ones received. From these three 
clusters seven eye statistics can be generated. In Fig. 20, the three 
scanning instants ¢,, t2, and fs, and the five data clusters, representing 
the distances a;b;, b;¢;, c:d;, die;, and e; f; (it = 1, 2, or 3), are indicated. 
Next, a,b; expressed as a proportion of the average eye height consti- 
tutes the top eye thickness, b;c;, cid;, die:, and e;f;, also expressed as 
proportions of the average eye height, constitute the top eye opening, 
the central thickness, the lower eye opening, and the bottom eye 
thickness, respectively. Finally, the positive average eye height and 
negative average eye height constitute the sixth and seventh pertinent 
eye statistics from the eye diagram at each of three scanning instants 
t; (i = 1, 2, or 3). However, the scanning instant ¢; is selected to have 
the maximum average top and bottom eye opening leading to a unique 
scanning instant and a unique set of seven statistics stored away for 
each eye. Two such seven-parameter, eye-statistical sets for each of 
the two eyes generated by data from co to s and from s to Co are thus 
stored for each loop. 
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5.3 From eye statistics to scatter plots 


The eye statistics of loops in the data base are generated and stored 
under a predefined set of terminal conditions. These eye statistics can 
now be displayed as a collection of scatter plots, each dot denoting a 
particular eye statistic of a particular loop. The top and bottom eye- 
opening statistics are assembled in one scatter plot. Along the X-axis, 
the physical length of the loop or the equivalent length of No. 22 awe 
cable (or, equivalently, the loop attenuation) may be plotted. The top 
central and bottom eye thickness may also be plotted on one scatter 
plot. With two independent directions of transmission, a set of eight 
scatter plots are obtained for each transmission scheme. The next set 
of plots is obtained by separating out the loops by the number of 
sections. Here, the X scale is plotted by the length of the loop but 
compressed by two-thirds to accommodate loops in the (1, 2, 3), (4, 5, 
6), or (=7) sections and up to 18,000 ft on each plot. A typical example 
of the entire set consists of 24 scatter plots. Here, we present six such 
plots (Figs. 21a through 21f) obtained by transmitting data at 80 kb/s 
in the TcM mode at line rate of 180 (2.25 x 80) kb/s. In Fig. 21a, the 
loop length is shown on the X-axis. The eye thickness (see Section 5.2) 
data are plotted in the Y direction. In Fig. 21b, the eye thickness is 
plotted against the equivalent length of the loop in kft of No. 22 awe 
cable at 55°F. At half the bit-rate frequency of 90 kHz, each division 
indicates about a 3.3-dB loop loss. In Figs. 21c through 2le, the eye 
thickness data from three sets of loops with 1-, 2-, or 3-cable sections, 
4-, 5-, or 6- cable sections, and finally 7-, 8-, or =9-cable sections are 
plotted. 

To study the effect of the accuracy of equalization, we have gener- 
ated the next family of scatter plots which indicates the eye height 
(also as a scatter plot), together with the eye opening and eye thickness 
data. In Fig. 21f, the effect of the eye height (which is allowed to 
change by approximately 20 percent by adjusting the equalization) on 
the eye thicknesses is plotted as scatter plots in the top half of the 
plots. In the lower half, the eye height data are plotted. 


5.4 Worst loop tabulations 


The scatter plots are a collection of points with each trio (Fig. 21f) 
or quad (Figs. 21a, etc.) representing a loop. However, it becomes 
necessary to identify the loops that do indeed cause failure or unex- 
pected eye closures. For this reason, a set of programs has been 
developed which identifies and tabulates the loops whose eye openings 
lie between any two predefined percentages for any terminal configu- 
ration. For example, the composition of loops that have a =25 percent 
eye closure is shown in Table VI. 
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Table VI—Data of loop that exhibit more than 25 percent eye closure 


WORST LOOP SEARCH-LOOP DETAILS— 
EYE CLOSURE = 25 PERCENT 
130, 5, 26, 2, 
4671, 24, 5348, 22, 671, 26, 759,19, 4122 22, 


2, 
2512, 22, 2333, 24, 4238, 22, 573, 24, 7292, 22, 190, 24, 


456, 2, 29, 2, 

13643, 22, 724, 26, 
699, 3, 28, 2, 

13308, 26, 1412, 24, 990, 26, 
740, 2, 25, 2, 

11523, 22, 6224, 24, 


’ 2, 
7789, 26, 2998, 24, 2564, 26, 25, 24, 957, 26, 2120, 24, 141, 26, 


765, 5, 29, 2, 
10694, 22, 153019, 1638, 24, 303, 26, 2477, 24, 
R 


Note: The first number (130) indicates the loop number, the second number (5) 
indicates the number of cable sections. The third and fourth numbers indicate the 
percentage eye closure (26) and the direction of transmission (2 for s to co and 1 for co 
to s). The number pairs on the next line denote the cable length and the gauge number. 
Five such numbers are printed because there are five cable sections. The same data 
format repeats for the next six loops. 


5.5 Percentage failure analysis 


An alternate way to investigate the success or failure of any terminal 
configuration is to identify the percentage of loops in each kilofoot 
band that has eye closure in excess of the prechosen eye openings. 
Typically, eight eye-closure (2.5, 5, 7.5, 10, 12.5, 15, 20, and 30 percent) 
limits are defined, and there are eighteen 1-kft bands in the loop data 
base. Results for the 80-kb/s TcM mode transmission is shown in Table 


Table Vil—Percentage of loops in each kilofoot band capable of 
carrying data with less than the specified eye closure (CO-to-S-data) 
Percentage (Eye Closure) 


2.5 5 7.5 10 12.5 15 20 30 
0-1 0.00 0.00 0.00 0.00 0.00 100.00 100.00 100.00 
1-2 0.00 0.00 0.00 0.00 7.89 100.00 100.00 100.00 
2-3 0.00 0.00 0.00 0.00 89.29 100.00 100.00 100.00 
3-4 0.00 0.00 0.00 46.38 98.55 100.00 100.00 100.00 
4-5 0.00 0.00 156 96.87 100.00 100.00 100.00 100.00 
5-6 0.00 0.00 54.55 96.10 100.00 100.00 100.00 100.00 
6-7 0.00 0.00 59.37 92.19 98.44 100.00 100.00 100.00 
7-8 0.00 1.39 52.78 95.83 100.00 100.00 100.00 100.00 
8-9 0.00 0.00 10.45 89.55 94.03 98.51 100.00 100.00 


9-10 0.00 0.00 7.55 73.58 86.79 100.00 100.00 100.00 
10-11 0.00 5.45 40.00 56.36 70.91 89.09 100.00 100.00 
11-12 0.00 28.85 40.38 55.77 61.54 73.08 100.00 100.00 
12-13 0.00 34.21 52.63 63.16 68.42 76.32 89.47 100.00 
13-14 0.00 9.68 32.26 54.84 67.74 70.97 80.65 100.00 
14-15 0.00 15.38 26.92 50.00 57.69 73.08 92.31 100.00 
15-16 0.00 9.52 23.81 42.86 47.62 57.14 66.67 100.00 
16-17 0.00 ~=0.00 50.00 83.33 83.33 83.33 83.33 100.00 
17-18 0.00 6.25 25.00 37.50 43.75 50.00 62.50 100.00 
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VII for the co-to-s eye data. A totally perfect system would have a 
100-percent result for each of the elements of the table, and a totally 
unacceptable system will have zeros for all the elements. Thus, the 
higher numbers toward the center of the table would tend to reflect 
the compromise in the designing terminal equipment and its cost. If 
the criterion is for 90 percent of the loop population to have 80 percent 
or better opening, the average of the number in the eighth column 
should be greater than or equal to 90, etc. 


5.6 Simulation data for typical 64- and 144-kb/s TCM and hybrid system 
with and without bridged taps 

Each simulation generates a large number of scatter plots and 
associated tables. This section gives the highlights of eight such 
simulations (Figs. 22 through 29). In Figs. 22a and 22b, the eye-opening 
and eye-thickness data for a 64-kb/s hybrid system are plotted with 
30-dB echo cancellation. Figure 22 pertains to co-to-s data transmis- 
sion and similar results (Figs. 23 through 29) are given under the other 
conditions. 


Table VIII—Loop percentage failure and shortest length for 


failure 
With Bridged 
Rate Mode No Bridged Taps Taps 
64 kb Hybrid 9.0%, 12 kft 31.2%, 5.7 kft 
TCM 0.0%, 18 kft 2.0%, 2.7 kft 
144 kb Hybrid 8.4%, 9 kft 46.5%, 2.5 kft 
TCM 0.0%, 18 kft 26.1%, 0.5 kft 


5.7 Discussion of the simulation results 


An example of the effect of bridged taps under a hybrid mode of 
transmission can be identified by comparing Figs. 22 and 23 at the 64- 
kb/s rate or Figs. 24 and 25 at the 144-kb/s rate. Similarly, the effect 
of bridged taps in the TcM mode of transmission is identifiable by 
comparing Fig. 26 with 27, or 28 with 29 at the two data rates. 
Conversely, a hybrid and a TCM system can be compared at the two 
rates by comparing Fig. 22 against 26, or Fig. 23 against 27 for the 64- 
kb/s rate, and by comparing Fig. 24 against 28, or Fig. 25 against 29 for 
the 144-kb/s rate. 


5.7.1 An example of a comparison of a TCM system with a hybrid 
system 


The entire population of loops in the loop survey can carry bidirec- 
tional data at 64 or 144 kb/s up to 18 kft with the TcM system when 
the bridged taps are stripped. The eye closure in all the cases has been 
computed to be <42 percent. In the hybrid mode with 30-dB echo 


1038 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 


cancellation and with similar equalization, the failure rate (criterion: 
eye closure >40 percent) is expected to be 9 percent at 64 kb/s, when 
all the bridged taps are stripped, and climb up to 31.2 percent with the 
bridged taps intact. 

_ At 144 kb/s with the Tcm system, all the loops (no bridged taps) 
carry the bidirectional data with 40 percent or less eye closure, except 
one single loop which exhibits a 42-percent closure. Under similar 
conditions, the hybrid system with 30-dB echo cancellation yields a 
loop failure rate of about 8.4 percent. Bridged taps deteriorate the 
performance of both the Tcm and the hybrid systems. However, the 
TCM system suffers a 26.1-percent loop failure against 46.5 percent for 
the hybrid system with 30-dB echo cancellation. 

The next significant observation is that in the hybrid system at 64 
kb/s with all the bridged taps stripped, the data can be transmitted 
successfully on all loops up to 12 kft, and the failure rate is 8, 35, 61, 
76, 66, and 81 percent in each of the 1-kft bands up to 18 kft. At 144 
kb/s-data transmission on the loops with all the bridged taps stripped 
is successful up to about 11 kft. The failure rates for 12- to 18-kft bands 
have been computed as 15.4, 26.3, 25.8, 7.7, 52.4, 66.7, and 100 percent, 
respectively. 

Further, the effect of the bridged taps forces the limitations of the 
loop length to 5.7 kft at 64 kb/s in the hybrid mode and to about 2.1 
kft at 144 kb/s. At 64 kb/s the failure rates in each of the 1-kft bands 
between 5 to 18 kft are 1.3, 1.6, 8.33, 61.2, 71.7, 72.73, 69.3, 63.2, 51.6, 
73.1, 76.1, 66.6, and 81.25 percent, respectively. At 144-kb/s, only the 
1- and 2-kft band qualify with a 100-percent success for data transmis- 
sion, and loop failure with other sixteen 1-kft bands are 3.57, 1.45, 6.25, 
23.38, 35.96, 26.39, 82.09, 88.68, 70.9, 73.1, 79.47, 100, 92.31, 95.24, 100, 
and 100 percent, respectively. On the other hand, the TcM mode of 
transmission is not particularly sensitive to loop length but, instead, to 
the particular bridged-tap length. The hybrid system suffers twice 
because of (z) reflection of the transmitted signals caused by impedance 
mismatches, and (zz) reflections from the bridged taps in the direct 
transmission path. The situation becomes complicated because of the 
length, number, and the distribution of the bridged taps along the loop 
length. 


§.7.2 Comparison of results for loops without and with bridged taps 


Loop failure is particularly susceptible to the presence of bridged 
taps. In the hybrid system at 64 kb/s, the penalty is that approximately 
22 percent (from 9.0 to 31.2 percent) more loops become incapable of 
carrying the data. In the TcM system, about 2 percent (from 0.0 to 2 
percent) loops are excluded. In the hybrid system at 144-kb/s, the 
penalty is about 38 percent (from 8.4 to 46.5 percent) of the loops 
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because of the presence of the bridged taps, whereas the TCM system 
suffers a 26 percent (from 0.0 to 26.1 percent) increased loop failure. 
These results are summarized in Table VIII. The next observation is 
that the loop-failure percentage has very little to do with the shortest 
loop that fails. Stated alternatively, the failure is likely to be inde- 
pendent of the loop length. However, from further analysis the char- 
acteristics of the failed loops indicate that even one single-bridged tap 
of critical length and appropriate gauge located anywhere along its 
length can destroy the eye. It has been determined that the length of 
this bridged tap is such as to cause a delay of one (or an integer 
number of) pulse period (periods, to lesser extent), thus, resulting in a 
systematic obliteration of the eye from either direction. 

The former observation is further confirmed by the fact that the 
shortest loop length (Table VIII) for a 144-kb/s Tcm system is only 0.5 
kft, whereas the loop percentage failure is about 26 percent. Similar 
results for the hybrid system indicate that the shortest loop length is 
about 2.5 kft, and the percentage failure rate is about 46 percent. 


5.8 Summary of results and conclusions 


We have been able to obtain the overall performance of many 
systems under different terminal conditions. The results presented in 
this paper are based on an equalizer (Section VI). In the hybrid mode 
of data transmission, there is reason to conclude that deterioration 
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Fig. 20—Extraction of eye statistics from the eye diagram. 
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starts to become worse after the shortest loop length for failure is 
reached, whereas a similar conclusion is not evident in the TCM mode. 
In the former case, the eye deterioration is basically caused by (i) 
reflections, and (iz) delayed transmissions. While the signal strength 
starts to fall because of longer loop lengths, the reflected signal retains 
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Fig. 21—Eye thickness scatter plots for 80-kb/s data transmission in the TCM mode. 
In the set of six figures (a through f) the data at the s end of the loops are given. (a) Eye 


thickness (Y-axis) versus loop length in kft (X-axis). (b) Eye thickness versus equivalent 
loop length (expressed in kft of No. 22 awe cable). 
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its strength. The delayed transmission, which is of concern to both the 
hybrid and the TcM mode, can interfere with the pulse in the next time 
slot. In the TcM mode, the delayed transmission alone is the cause of 
loop performance degradation. 
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Fig. 21—(c) Eye thickness for loops with 1-, 2-, or 3-cable sections versus loop length 
in kft (numbers indicate number of sections). (d) Eye thickness for loops with 4-, 5-, or 
6-cable sections versus loop length in kft (numbers indicate number of sections). 


1042 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1982 


EYE OPENING NEGATIVE POSITIVE 


EYE HEIGHT 





9 12 15 18 
LOOP LENGTH IN KILOFEET 


Fig. 2i—(e) Eye thickness for loops with 7-, 8-, or =9-cable sections versus loop 
length in kft (numbers indicate number of sections). (f) Eye thickness and eye height for 
all loops versus loop length in kft. 
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Fig. 22—Eye data scatter plots for 64-kb/s hybrid transmission with 30-dB echo 
cancellation. Bridged taps are stripped from the loops. Subscriber side data are given. 
(a) Eye opening. (b) Eye thickness. 
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Fig. 23—Eye data scatter plots for 64-kb/s hybrid transmission with 30-dB echo 


cancellation. Bridged taps are left intact. Subscriber side data are given. (a) Eye opening. 
(b) Eye thickness. 
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Fig. 24—Eye data scatter plots for 144-kb/s hybrid transmission with 30-dB echo 


cancellation. Bridged taps are stripped from the loops. Subscriber side data are given. 
(a) Eye opening. (b) Eye thickness. 
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Fig. 25—Eye data scatter plots for 144-kb/s hybrid transmission with 30-dB echo 
cancellation. Bridged taps are left intact. Subscriber side data are depicted. (a) Eye 
opening. (b) Eye thickness. 
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Fig. 26—Eye data scatter plots for 64-kb/s TcM-mode transmission (actual line rate 
is 144 kb/s). Bridged taps are stripped from the loops. Subscriber side data are given. 
(a) Eye opening. (b) Eye thickness. 
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Fig. 27—Eye data scatter plots for 64-kb/s TCM-mode transmission (actual line rate 
is 144 kb/s). Bridged taps are left intact. Subscriber side data are given. (a) Eye opening. 
(b) Eye thickness. 
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Fig. 28—Eye data scatter plots for 144-kb/s TcM-mode transmission (actual line rate 
is 324 kb/s). Bridged taps are stripped from the loops. Subscriber side data are given. 
(a) Eye opening. (b) Eye thickness. 
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Fig. 29—Hye data scatter plots for 144-kb/s TcM-mode transmission (actual line rate 
is 324 kb/s). Bridged taps are left intact. Subscriber side data are given. (a) Eye opening. 
(b) Eye thickness. 


Vi. ANALYSIS, DESIGN OPTIMIZATION, AND EVALUATION OF 
EQUALIZERS 


6.1 Introduction 


Four equalizer designs are analyzed and evaluated for their appli- 
cation to data transmission in the range of 56 to 144 kb/s. The first 
design is for 144-kb/s transmission. The second design is specifically 
for a line rate* of 324 kb/s. The remaining two designs are intended 
for a range of Tc applications in the region of 126 (2.25 x 56) kb/s to 
324 (2.25 x 144) kb/s. Realizations of the designs are discussed and 


* The information rate is 144 kb/s, but the line rate for this design is enhanced to 324 
kb/s because of the Tc operation. , 
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their actual performances are computed by a series of simulation 
programs discussed in Sections II, III, and IV. To prevent cable 
mismatch effects (which are indeed present in a hybrid system) from 
overshadowing the influence of equalization, we present simulation 
results utilizing continuous unidirectional transmission, thus, focusing 
on the performance of the equalizer. The equalizers then can be utilized 
in either a hybrid or a TCM-mode system. The eye-opening scatter 
plots obtained by analyzing the system performance on actual loops 
taken from the loop survey are used as the ultimate criteria to judge 
the equalizer quality. 

A digital-system equalizer attempts to undo the effect of cable 
attenuation and restore the pulse shape at the receiver; however, pulse 
shape results from a composite of the amplitudes of many frequency 
components, their phase relationships, and their harmonic relation- 
ships. Hence, a perfect pulse restoration at all bit rates entails perfect 
responses at all frequencies. In essence, an inverse loop circuit consti- 
tutes a perfect equalizer circuit. However, loops, bit rates, and coding 
algorithms are each numerous and uncorrelated, thus, causing addi- 
tional elements of complexity in the design of equalizers. Even though 
the results presented here are for the bipolar code, it should be noted 
that for other codes, the region of spectral emphasis for matching 
cable loss and equalizer gain will be dictated by the bandwidth require- 
ment of the particular code. 


6.2 Three-section, 18,000-ft equalizer . 
6.2.1 Circuit configuration and representation 


Each of the three sections has a five-pole, five-zero circuit. The 
location of these poles and zeros are obtained by the solution of five 
simultaneous equations that are formulated to yield exact equalizations 
of 6,000 ft of No. 22 awe cable at five discrete frequencies (1, 2, 10, 50, 
and 100 kHz). Further, the transfer function of the Rc circuit is 
controlled by the gain of an operational amplifier, and this gain is 
adjusted by yield equalization of loops or loop sections under 6000 ft 
of cable. Thus, loops of any length under 18,000 ft can be equalized by 
cascading one, two, or three sections of this equalizer and appropriately 
adjusting the gains on the three sections of the equalizer. Variation in 
temperatures is converted to a variation in effective loop length, and 
the eye height is maintained by the gains of the three sections of the 
equalizer. 


6.2.2 Equalizer performance with composite cables 


The performance evaluation of this equalizer with composite cables 
is limited to a study of eye openings as the composition of the cables 
is changed gradually for an overall loop length of 18 kft. Two major 
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simulations are performed. In the first simulation, a loop comprising 
Nos. 22 and 24 awe cables with an overall length of 18 kft is chosen. 
At one extreme, the loop is totally composed of No. 22 awe cable; 
conversely, at the other extreme, the loop consists of No. 24 awG cable. 
These two cases are presented at the left and right sides of the X-axis 
in Fig. 30. As the distance along the X-axis is increased, the No. 22 
AWG cable is shortened and the length of No. 24 Awe cable is increased. 
The eye openings are plotted on the Y-axis. The two eyes generated 
for alternate bipolar coding yield two data points for each loop com- 
position. 

Similar results for Nos. 19 and 26 awe cable compositions are shown 
in Fig. 31. The simulations are carried out at 144 kb/s. The perform- 
ance of the equalizer and the quality of the filter are both reflected by 
the eye opening. The first filter at the transmitting end has the 3-dB 
point located at half the bit rate. The second identical filter is located 
at the receiver end. This pair of filters essentially yields a pulse shape, 
with the amplitude rising from zero and steadily reaching a peak and 
diminishing to zero, one pulse period after its peak. 

In Figs. 30 and 31, a pair of points along each vertical line represents 
an independent simulation for the appropriate composition of cables. 
Eye diagrams are internally generated and the seven eye statistics 
(Section 5.2) are extracted. For each loop, two dots representing the 
positive and negative eye opening are plotted. From Figs. 30 and 31, it 
can be expected that the equalizer compensates 18 kft of Nos. 19, 22, 
24, and 26 awG cables with 70, 80, 70, and 0 percent eye openings, 
respectively. With only No. 26 Awe loops, independent simulations 
indicate that this equalizer provides an eye opening of about 40 percent 
with 12 kft loops. It is also interesting to note that the eye openings in 
either direction of transmission are roughly the same because the 
effects of reflections (as they occur in a hybrid system) are eliminated, 
thus, emphasizing only equalizer and filter performances. Proper filter 
performance is assured by a 100-percent eye opening at zero cable 
length. The equalizer performance with the loops in the data base may 
be examined by studying the scatter plots of Figs. 32a and 32b. The 
eye openings (Fig. 32a) and the eye thickness (Fig. 32b) are generated 
by the transmission of random data with bipolar codes. 


6.3 Four-section, 20,000-ft equalizer 
6.3.1 Circuit design 


The range and spectrum of equalization are both enhanced for this 
design. As opposed to the former design, a “staged turn-on” algorithm 
has been used for more effective equalization. In this mode, the four 
sections are turned on, one by one, as the range of each section is 
exhausted by the loop attentuation. Further, the equalization is also 
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forced out to a frequency of 250 kHz, thus, increasing the total 
maximum gain from 60 dB for the former design, to about 72 dB. Each 
section of this six-pole, six-zero equalizer has exact equalization at 1, 
2, 10, 50, 100, and 250 kHz for 5000 ft of No. 22 Awe loop. 


6.3.2 Equalization data for loop attenuations 


The performance of this equalizer is studied at three frequencies. 
The attenuation of the 831, 1973 survey loops (Section III), in the 
_ truncated data base are computed at frequencies of 56, 144, and 324 

kHz. Next, the equalization gain for each of the loops at the three 
corresponding frequencies is also computed. As a next step, the equiv- 
alent length of the composite loops, in terms of No. 22 awe cable, is 
calculated and the loop attenuations are now plotted in Figs. 33a, 34a, 
and 35a for these three frequencies at 56, 144, and 324 kHz, respec- 
tively. These scatter plots represent approximate straight lines since 
the gauge-length conversions (see Section 3.4) bring the individual 
loop attenuation points parallel to the X-axis to roughly fall along the 
No. 22 awe cable attenuation line. Finally, the equalizer gains for all 
the loops are also plotted in Figs. 33b, 34b, and 35b. For perfect 
equalization, the lines in Figs. 33a and 33b, 34a and 34b, and 35a and 
35b should be collinear. The imperfection of equalization is indicated 
by the noncollinearity of these lines or, for that matter, the noncon- 
currence of these individual points. 


6.3.3 The 1973 loop survey eye diagram performance 


Simulations at 324 kb/s (the Tcm mode rate corresponding to 144- 
kb/s bidirectional hybrid rate) and at 126 kb/s (the TcmM mode rate 
corresponding to 56-kb/s bidirectional hybrid rate) are presented to 
evaluate the performance of this equalizer. In Fig. 36, the eye openings 
are shown as a scatter plot of the 1973 survey loops when transmitting 
data at 324 kb/s in the TcM mode. These results correspond to co-to- 
s data transmission. Similar results generated for S-to-co transmission 
are not presented. Figure 37 shows the eye-opening data when the 
system is excited with 126-kb/s TcM mode. 


6.4 Limited-range, single-pole equalizer 


In the previous two designs the equalizer gain exactly compensates 
the cable loss at specific frequencies (five for the first, six for the 
second design) and at specific cable lengths (6 kft of No. 22 awe and 
5 kft of No. 22 awe cables). Variations because of length are accom- 
plished by changing the gain control on the operational amplifiers. 
Equalizers reported in Sections 6.4 and 6.5 are designed by broadly 
matching the shape of the cable loss with frequency against the gain 
of the equalizer in the frequency domain. Overall gain changed to 
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accommodate different cable lengths are adjusted by changing diode 
resistances. This, in turn, adjusts the gain of the operational amplifier 
without drastically changing the frequency characteristics. A single 
pole and an adjustable zero-type of transfer function is used to generate 
a suitable equalization range over a 6- to 36-dB cable loss. When these 
poles and zero’s are redistributed, a corresponding range of equalization 
may be obtained for these lower frequency applications. The applica- 
tion considered here is for a 324-kb/s TCM-mode data transmission. 
Cable attenuations and equalizer gains at the half-baud rate (162 kHz) 
are both plotted in Fig. 38. The dotted lines indicate the cable atten- 
uations of the Nos. 19, 22, 24, and 26 Awe cables. The full lines indicate 
the corresponding equalizer gain. The chief limiting factor is the range 
of equalization. The equalizer saturating at about 36 dB cannot com- 
pensate for cable losses greater than about 10.7 kft of No. 26 awe at 
162 kHz. Our study of the loop plant indicates that a considerable 
fraction of loops extends beyond this distance. Hence, the design of 
this particular type of equalizer has been abandoned for the 324-kb/s 
TCM mode of bidirectional data transmission. A limited study of eye 
opening data has indicated that only 30 to 40% of the loops would 
carry bidirectional data with a 60-percent or better eye opening at 324- 
kb/s TCM-mode data transmission. 


6.5 Enhanced-range, wider-frequency, optimized equalizer 


The design philosophy for this equalizer is the same as that for the 
former equalizer; However, two cascaded, shaped gain amplifiers are 
used in this case. The first has a low-frequency zero and two higher- 
frequency poles, while the second has a low-frequency zero, another 
right-half-plane zero, and a pair of complex poles. However, optimiza- 
tion by the incremental displacement of poles and zeros is essential to 
follow the loop attenuation at frequencies of interest. 


6.5.1 Optimization strategy 


An optimization approach in both the frequency and distance do- 
mains is simultaneously attempted to match the equalizer gain against 
the cable loss. First, half-baud rate frequency cable attenuation lines 
are generated for Nos. 19, 22, 24, and 26 awe cables. Then the gauge 
conversion factors (see Section 3.4) at this frequency are used as a 
basis of all the gauge conversions. The maximum equivalent loop 
length (as extracted from the loop survey) is then determined to 
correspond to about 16.7 kft of No. 26 AwcG cable. Next, No. 26 awG 
cable losses at 5, 10, 50, 100, 150, 200, and 250 kHz are generated by 
the primary characteristics of the No. 26 AwcG cable (see dashed lines 
in Figs. 39a and 39b). The poles and zeros of the equalizer are given 
incremental changes such that the equalizer gain and the cable losses 
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are almost collinear at about the half-baud rate frequency over the 
span of one-half to two-thirds of maximum cable length (see Figs. 39a 
and 39b). The first step of optimization is more or less easily achieved; 
however, in the second step of shifting the R’s and C’s of the equalizer 
circuit such that the noncollinearity of the equalizer gain and the cable 
loss at the low-frequency end of the spectrum has to be weighed 
against the noncollinearity of the two at the higher end of the spectrum. 
The optimization becomes complicated by the fact that both the low- 
and high-frequency noncollinearities themselves display crossovers in 
the distance domain. Further, the crossover at the low-frequency end 
has an intersection going from a lower gain to a higher gain as the 
length of the cable is increased. On the other hand, the crossover at 
the higher frequency follows an opposite type of characteristic chang- 
ing from higher gain to lower gain as the distance increases. Thus, a 
large number of trials are necessary to achieve a satisfactory compro- 
mise. 

The effect of this optimization implies one distinct effect on the 
single-pulse response. At lower cable lengths, there is a distinct over- 
shoot because of the higher frequency over equalization and the lower 
frequency under equalization. Conversely, at longer cable lengths there 
is a tail left behind in the single-pulse response because of lower 
frequency over equalization and higher frequency under equalization. 
But there is a certain range of lengths at which the single-pulse 
response approaches perfection. We have trapped and exploited this 
perfection to be about two-thirds the maximum cable length. Thus, 
the scatter plot generated by this type of equalizer shows nearly 100 
percent eye openings (to be discussed next) in the central one-third 
span of the loops. This type of equalizer behavior is a distinct contrast 
to those discussed in Sections 6.2 and 6.3. Further, it is consistently 
good over a range of 126 to 324 kb/s. 


6.5.2 Selected loop performance 


At 55 ft of No. 24 awe cable, the eye diagram is imperfect. The cable 
distortion component can be ruled out at this minimal length. How- 
ever, in our effort to reduce the circuit complexity, we have eliminated 
the transmit filter and optimized the pulse width at about 40 percent 
of the pulse period.* The harmonics of this pulse shape are affected by 
the receiver filter and by the very slight effect of the equalizer. This 
has been understood to be the reason for the slight imperfection of the 
eye diagram shown in Fig. 40 at 324-kb/s operation. 


* The pulse width for the best eye opening was found to exhibit broad insensitivity 
in the range of 35 to 42 percent of the pulse period. Hence, for all the simulations, we 
have held the transmit pulse width at 40 percent and retained a half-baud rate 3 dB 
down four-section Bessel filter at the receiver end of the loop. 
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A typical loop response around 6 kft is presented in Fig. 41 at 324 
kb/s. In Fig. 41, a seven-section loop with an overall length of 6093 ft 
is chosen at 60°F. The eye diagram” is relatively the same from both 
directions of transmission. 

Two loops around 12 kft show the response of the equalizer. In Fig. 
42, the eye closure is about 18.7 percent at 324 kb/s. This loop is also 
composed of seven sections and is the worst loop in that region of 
overall loop lengths. The best system performance also takes place 
around this length, with a single-section loop with an 11,998-ft No. 26 
AWG cable as shown in Fig. 43. The eye closure is only 3 percent, and 
the response is consistent with the design of the equalizer as discussed 
in Section 6.5.1. 

In Fig. 44, the worst loop (17,314 ft loop) response is shown. The eye 
closures are about 43 and 42.3 percent for the two sides of transmission, 
and one can visualize the presence of long tails left behind single pulses 
in these two eye diagrams. 


6.5.3 The 1973 loop survey eye diagram performance 


The loops in the truncated loop survey data base have been simu- 
lated at four different Tcm frequencies (126, 162, 216, and 324 kb/s to 
correspond to full-duplex bidirectional rates of 56, 72, 96, and 144 kb/ 
s). The scatter plots of eye openings are plotted in Figs. 45, 46, 47, and 
48. All these scatter plots exhibit a pattern yielding the best eye 
openings in the area of the 8 to 13 kft length rather than at about zero 
length. This particular performance is because of the equalizer opti- 
mization strategy. 


6.6 Conclusions 


The design of the equalizer has a profound effect on the capability 
of the loops to transmit the digital data. Here, we have compared the 
various design strategies and studied their effects. An equalizer design 
using the presently available configurations was optimized for a center 
frequency of 81 kHz to correspond to 162-kb/s Tcm mode. However, a 
reoptimization of this design at 126 and 324 kb/s was deemed unnec- 
essary because of its satisfactory performance. The optimization pro- 
cedure described here should prove valuable for the design and imple- 
mentation of any equalizer configuration, provided its maximum range 
is adequate for the attentuation expected from the longest of the loops. 
Effects of cable-bridged taps, impulse noise, and crosstalk on the error 
rate of the data transmission are not included in this study. If these 
effects limit the maximum range of equalization, then any of the 


* The eye closure is 21.1 percent for the co-to-s eye diagram and 21.4 percent for the 
s-to-co eye diagram. 
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proposed designs may be effectively used by employing the optimiza- 
tion strategy presented in this paper. 

All the simulation results presented here are for the bipolar code. 
However, it can be visualized that the design philosophy and the 
optimization techniques are universal to the extent that the region of 
spectral emphasis (Figs. 39a and 39b) will be dictated by the bandwidth 
requirements of the particular code. 
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Fig. 30—Eye openings with Nos. 22 and 24 awG composite cable loops having a total 
length of 18 kft (144 kb/s). 
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Fig. 31—Eye openings with Nos. 19 and 26 AwG composite cable loops having a total 
length of 18 kft (144 kb/s). 
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Fig. 32—Eye data scatter plot obtained by the simulation of 1973 survey loops with 
the three-section, 18,000-ft equalizer at 144 kb/s. (a) Eye opening. (b) Eye thickness. 
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0 3 6 9 12 15 18 21 24 27 
LOOP LENGTH IN KILOFEET 
Fig. 33—Comparison of loop attenuations and equalizer gains for 1973 loop survey at 
56 kHz plotted against the loop length expressed in equivalent kft of No. 22 awe cable. 
Scale on the Y-axis is 5 dB/division, and on the X-axis it is 1.5 kft/division. (a) Loop 
attenuation. (b) Equalizer gain. 
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Fig. 34—Comparison of loop attenuations and equalizer gains for various loops at 144 
kHz. The Y-axis scale is 6 dB/division. (a) Loop attenuation. (b) Equalizer gain. 
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Fig. 35—Comparison of loop attenuations and equalizer gains for various loops at 324 
kHz. The Y-axis scale is 8 dB/division. (a) Loop attenuation. (b) Equalizer gain. 
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Fig. 36-—Eye opening scatter plot at 324 kb/s with 20,000-ft, four-section equalizer in 
the Tcm mode of transmission. Subscriber side data are given. 
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Fig. 37—EHye opening scatter plot at 126 kb/s with 20,000-ft, four-section equalizer in 
the Tcm mode of transmission. Subscriber side data are given. 
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Fig. 38—Cable losses (dotted lines) and equalizer gains of a limited range, single-pole 
equalizer plotted for a 324-kb/s TcM design. Loop length (1 ay anions is plotted on 
the X-axis and loop loss (7 dB/division) is plotted on the Y-axis 
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Fig. 39—-Comparison of cable loss (dashed lines) and equalizer gains of the enhanced 
range, wider frequency, optimized equalizer at different frequencies for the No. 26 awG 
cable. The Y-axis is 8 dB/division. (a) Plot against the actual loop length (the X-axis 
scale is 1 kft/division). (b) Plot against the equivalent loop length in terms of No. 22 
AWG cable (the X-axis scale is 2 kft/division). 
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Fig. 40—Eye diagram showing the effect of finite-pulse width with 55 ft of No. 24 


AWG cable at 324 kb/s. 
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Fig. 41—Seven-section loop (88 ft 
AwG; 310 ft, No. 22 awa; 19 ft, No. 24 awa, 


response at 324-kb/s TcM excitation. 
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Fig. 42—Seven-section (3450 ft, No. 26 awa; 261 ft, No. 24 awa; 50 . No. 26 awG; 
313 ft, No. 22 awa; 2477 ft, No. 24 AWG; 5329 ft, No. 22 awe; and 200 ft, No. 24 awa) 


loop response at 324-kb/s TcM excitation. 
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Fig. 43—Eye diagram for a single-section loop consisting of 11,998 ft of No. 26 awcG 


cable at 324 kb/s. 
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Fig. 44—Worst eye af for an eight-section (7789 ft, No. 26 awe; 2998 ft, No. 24 
AWG; 2564 ft, No. 26 awa; 25 ft, No. 24 awa; 957 ft, No. 26 Awa; 2120 ft, No. 24 awa; 141 


ft, No. 26 awG; and 720 ft, No. 24 awe), 17,314 ft loop at 324 kb/s. 
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Fig. 45—Eye opening scatter plot for the 1973 survey loops at 126 kb/s Tcm mode. 


(Effective rate is 56 kb/s.) 
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Fig. 446—Eye opening scatter plot for the 1973 survey loops at 162 kb/s TCM mode. 
(Effective rate is 72 kb/s.) 


4. # 
Sp ehenes Tee el . 
Pe 


POSITIVE 


NEGATIVE 


cls foibeeaet Sg Io eee 


0 3 6 9 12 15 18 
LOOP LENGTH IN KILOFEET 





Fig. 47—Eye opening scatter plot for the 1973 survey loops at 216 kb/s TcmM mode. 
(Effective rate is 96 kb/s.) 
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Fig. 48—Eye opening scatter plot for the 1973 survey loops at 324 kb/s. (Effective 
rate is 144 kb/s.) 


Vil. CONCLUDING REMARKS 


The limit-cycle approach used in the paper and discussed in the 
Appendixes provides a broad spectrum of results important for the 
overall design of the digital transmission system. The statistical anal- 
ysis of the loop plant provides a basis for the design of terminal 
circuitry, especially for the value of the terminating impedances for a 
range of transmission rates, and it also indicates the extent of mis- 
match. For the hybrid system design, this mismatch is the initial basis 
to choose the quality and extent of echo cancellation. The time-domain 
response in Section IV is the final basis to study the system perform- 
ance. The better systems yield a satisfactory eye opening even with 
the worst configuration of loop studied in Section III. Next, the time- 
domain simulations over the entire statistically significant population 
of the loops (Section III) has been cross-compared for the two com- 
peting systems (the hybrid and the Tcm) at various transmission rates. 
The summary of these results under two conditions (with bridged taps 
and without bridged taps) are presented in Section 5.7. Finally, the 
influence of the equalizer design is clearly indicated in Section VI. It 
is here that we propose the approach for a systematic design of the 
equalizer for data transmission. The computer-aided design technique 
is outlined and the influence of the design strategy is clearly delineated. 
These designs can be modified and matched with the typical loop 
environment for the other national telephone networks. Within the 
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Bell System, the designs have been tailored to the loop environment 
at data rates between 56 to 324 kb/s. 

System design rules vary drastically with the loop environment. 
Hybrid systems are ideally suited where loop discontinuities are ab- 
sent. In this case, a low-quality echo canceler can be complimented 
by an excellent matching circuit. The time-compression mode is suit- 
able for a widely varying loop environment, especially if the extent of 
the echo cancellation is limited to 30 to 35 dB. Further in the Tc, the 
extent and persistence of echos at the end of a burst also influences 
the guard-time duration. Such simulations (even though not reported 
here) have been conducted” and some loops (not essentially those 
with high-loop losses) have longer time constants for these echos. 
Acceptable loop configurations in the presence of bridged taps are also 
critically influenced by the design of the equalizer and filters. For 
example, the nature of balance that the designer imposes between 
high-frequency and low-frequency gains (depending on the distribution 
of the poles and zeros in the equalizer and filters) has a wide ranging 
influence on the allowable bridged tap(s). Hence, to permit a wide 
variety of loop acceptance, it is desirable to adjust the equalizer such 
that the fixed design of the equalizer totally undoes the effect of an 
average loop with an average configuration of bridged taps. Deviations 
(because of loop variations) from this perfect compensation causes an 
error which distributes the eye closure around about a near open eye 
diagram. It is here that the designer can obtain an acceptable eye 
opening (about 60 percent) even with the worst loops in the data base 
by choosing the optimal circuit components. Such designs have also 
been successfully obtained yielding very low loop failure rates in the 
TCM mode of transmission. 

Further, it is also possible to computationally superpose two inde- 
pendent limit-cycle conditions (one for impulse noise and one for data 
transmission) to study the effects’® of impulse noise events on data 
transmission. The simulations graphically show the influence on the 
wave shape of the received data and the eye diagram it generates. To 
study the effects of a typical event on a cross section of the loops in 
the loop survey data base, we have generated scatter plots of s/n’s 
because of the imperfection of the transmission system with and 
without the events. High-loss loops suffer more. The nature and 
harmonic content of the event both influence the error rate. Typically 
recorded events do not display drastic influence on loops at 64 kb/s 
(in the TcM mode of transmission) until they reach about a 30-dB loss 
at 72 kHz. The influence starts to become significant with loops of 40- 
dB loss—the s/n reduces from 7 to about 3 dB and loops over 40 dB 
tend to become very unpredictable. 
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APPENDIX A 
Formulations and Component Representation 
A.1 Excitation functions 

A.1.1 Delta function pulse shape 


The mth harmonic of a single-delta function repeating indefinitely, 
trapped in a limit cycle every T seconds, may be represented as 


A(p) = ai + ase?" + aye??? + aye PT + +, (1) 


where p = 27im/T and a, = 1 denotes a positive delta function, a; = 
0 denotes the absence and a; = —1 denotes the negative-delta function. 
Further 

ay 


A(p) er (2) 


The mth harmonic of K such delta-function pulses displaced from the 
first pulses by 7, 27, 37, ---, kt, --+, (K — 1)r seconds can be 
generalized as 

k=K ape DPT 


B(p) = ¥ 


pr = e7P") @) 


A.1.2 Finite pulse-width square-wave pulse shape 


When a single pulse of width 6 repeats infinitely, trapped in a limit 
cycle repeating every T seconds, then it may be represented as a 
summation of one positive unit function (repeating infinitely every T 
seconds) and one negative unit function (also repeating infinitely every 
T seconds) but displaced by 6 seconds. Hence, the algebraic represen- 
tation of this pulse becomes 


A(p) = = [(1 + PT 4-27 + PT...) 


a e (1 ae ert +. e PT 4 e PT B -)] 
ai(1 — e*) 
ae ERE (4) 
p(l — e?”) 
where p = 27im/T and a; (as in Appendix A, section A.1.1) denotes 
presence or absence of the pulse. For alternate bipolar pulses, the 
value of a becomes +1, 0 or —1, and for multilevel pulse simulations, 
a can become g, (g — 1) --- 1,0, —1, --- (¢ — 1), — q, and this would 
generate a (2g + 1) level eye diagram. 
The effect of K such pulses a; --- a, displaced from the first pulse 
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Fig. 49—Loop without bridged taps. 


by 1, 27, --- kr, --» (K — 1)r can be written as 


eK aq, (1 — e?*) e Ur 


B = 
(P)= Tae) 


‘APPENDIX B 
Loop Calculations 
B.1 Loops without bridged taps 


In Fig. 49, a loop with four gauges and three temperatures is shown. 
Fifty-six individual sections are created, each having its own gauge and 
its own temperature. For these sections, the individual ABCD param- 
eters are generated. Composite loop parameters are obtained by matrix 
multiplication* at each harmonic frequency of the excitation function 
[See eqs. (3) and (5).] 


B.2 Loops with bridged taps 


In Fig. 50, a loop with a sections and b bridged taps is shown. Each 
bridged tap has c sections. This would be the most general represen- 
tation of any loop without load coils. Only one single temperature is 
permitted for the entire loop and its bridged taps, to keep the com- 


* Each term is complex and should be treated as such. 
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Fig. 50—Loop with bridged taps. 


plexity of the representation and to conserve the core space require- 
ments for the computer. Individual ABCD parameters for the individ- 
ual loop section is first generated and stored. When a bridged tap is 
encountered, its own ABCD parameters are obtained by normal ABCD 
matrix multiplication for all the sections in that particular bridged tap. 
The loop at this point sees the bridged tap as a parallel connection, 
and the bridged tap ABCD is obtained by making A and D unity, B 
zero, and C as (Yo tanhT’), where Yo is the admittance (i.e., the square 
root of CD divided by AB) prior to conversion, and I‘ is propagation 
constant for the entire bridged tap (i.e., the complex log of half the 
sum of A, D, B/Zo, and ZoC, where Zo = 1/Yo) prior to conversion. 

Thus, the matrix reduction is continued in the presence of the 
bridged taps, and the net ABCD parameters for the entire loop are 
obtained. 
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APPENDIX C 
Conversion to Time Domain 


The system’s spectral-domain response can be written as 


F(p) = B(p) X §(p), (6) 


where {(p) represents the system’s transfer function for the mth 
harmonic and p is defined 27im/T. The inverse Fourier transform of 
F(p) is 


1 + 
f(t) = a B(p)e"'S(p)dp 
2at J_,., 


1 +i po Ka pe YP" &( ped 





Qmi J, kel 1-¢?" 
ais +2c7im 

=) sum of residues at p = # | (7) 
k=1 


For delta functions, we have B( p) defined by eq. (3), and each residue 
at the mth harmonic can now be evaluated as 


(» a) k=K aze~ "YP" ¢( pe?! 
k=1 


[a—1]m=m = lim | T = epr . (8) 


pr 





By differentiating the numerator and denominator separately and 
by L’Ho6pital’s rule, we have 


k=K e7 F-Dpt pt 
[a-1|m-m = >} ee (9) 
k=1 


since e”? — 1 as p> 27im/T. The summation of residues leads to the 
time-domain response 





_(k-1)27im_ | f Qorim \ 2z7ime 
ae Kx ¢ T eT 


m=+M k=K 


Way 2s 2) 


m=—M k=1 T 


(10) 


In the earlier computations, M is limited to 64, K is 48. In the more 
recent computations, M and K have ranged between 64-1200, and 48- 
480, respectively. 
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